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Aleurain signal (ALE) Linker 




Hin dill (1245) Hin dill (1667) 

Generic 

ALE-TER Nco j (]252) mt i (1372) 
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Figure 9 



KDEL-COOH ER retention sequence 
Not I 

A A A K PLK DEL * 
1 GCGGCCGCGA AACCACTGAA GGATGAGCTG TAA 



Figure _J_0 



FAE-LINKER-FRAMESHIFT structure and sequence 



FAE LINKER ORF FRAME SH IFT 




FAE-LINKER-FRAMESHIFT 

48 ip 

+ 1GACT WPV AAA ETTE G 
SphI NotI 



1 GGCGCATGCA CCTGGCCGGT CGCGGCCGCG GAAACCACTG AAGGATGA 
CCGCGTACGT GGACCGGCCA GCGCCGGCGC CTTTGGTGAC TTCCTACT 
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Vectors 
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ER/VAC 
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Signal sequences 
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- aleurain-delNPIR -FAE 
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- aleurain-NPIR-FAE 

- aleurain-NPIR-FAE-linker-KDEL 

- aleurain-NPIR-FAE-linker-frameshift 

- aleurain-NPIR-FAE-linker-stop 

- Aspergillus signal -FAE-KDEL 
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Figure _]3 



ALEURAIN -NPIR (Vacuolar) and NPGR (Apoplast) structure and sequence 



NPIR UNDERLINE 
NPGR BOLD 



Xco\(9) NPIR 
ff//idll[(2) 0 R F NPGR AoM(129) 
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ALEURAIN-NPIR 

134 tp 



+ 1 M AHA RVL LLAL AVL ATA AVA 

Hindlll Ncol 



1 AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTCGCCG 
TTCGAATGGT ACCGGGTGCG GGCGCAGGAG GAGGACCGCG AGCGGCACGA CCGGTGCCGG CGGCAGCGGC 

+ 1VASS SSF ADS N P I R P V T D R A A 

Not I 

71 TCGCCTCCTC CTCCTCCTTC GCCGACTCCA ACCCGATCCG GCCCGTCACC GACCGCGCGG CCGC 
AGCGGAGGAG GAGGAGGAAG CGGCTGAGGT TGGGCTAGGC CGGGCAGTGG CTGGCGCGCC GGCG 



Figure jL i 



RAT SIALYL TRANSFERASE Golgi signal sequence 

Hindlll 

M I H T NLK KKFS LFI LVF LLFA 
1 AAGCTTACCA TGATCCACAC CAACCTCAAA AAGAAGTTCT CCCTCTTCAT CCTCGTCTTC CTCCTCTTCG 

• VIC VWK KGSD YEA LTL QAKE F Q M • 
71 CCGTGATCTG CGTGTGGAAG AAGGGCTCCG ACTACGAGGC CCTCACCCTC CAAGCCAAGG AGTTCCAAAT 

NotI 



• A A 
141 GGCGGCCGC 



Figure JjT 



POTATO PROTEASE INHIBITOR II Apoplast signal sequence 

Hindi I I 

M XVH KEV NFVA YLL IVL GLLL 

1 AAGCTTACMA tggmcgtgca caaggaggts aacttcgtsg cctacctcct gatcgtsctc 

GGCCTCCTCT 

Ncol 



. LVS AME HVDA KAC TXE CGN 
G F G • 

71 TGCTCGTSTC CGCCATGGAG CACGTGGACG CCAAGGCCTG CaCCCKCGAG TGCGGCAACC 
TCGGCTTCGG 

NotI 

• I C P AAA 
141 CATCTGCCCG GCGGCCGCC 
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Figure 



Figure 3a B 



Ncol 



Hindlll 



M AHA RVL LLAL AVL ATA AVAV 
1 AAGCTTACCA TGGCCCACGC GCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTGGCCG 

NPIR NotI 



• ASS SSF ADSN PIR PVT DRAA AST 
71 TCGCCTCCTC CTCCTGCTTC GCCGACTCCA ACCCGATCCG GCCCGTCACC GACCGCGCGG CCGCCTCCAC 

* Q G I SEDL YSR LVE MATI SQA AYA 
141 GCAGGGCATC TCCGAAGACC TCTACAGCCG TTTAGTCGAA ATGGCCACTA TCTCCCAAGC TGCCTACGCC 

Sail 



DLCN IPS Til KGEK IYN SQT DING 
211 GACCTGTGCA ACATTCCGTC GACTATTATC AAGGGAGAGA AAATTTACAA TTCTCAAACT GACATTAACG 

BamHI 



- W I L RDD SSKE I I T VFR GTGS DTN 
281 GATGGATCCT CCGCGACGAC AGCAGCAAAG AAATAATCAC CGTCTTCCGT GGCACTGGTA GTGATACGAA 

Glycosylation 

* L Q L DTDY TLT PFD TLPQ CNG CEV 
3 51 TCTACAACTC GATACTGACT ACACCCTCAC GCCTTTCGAC ACCCTACCAC AATGCAACGG TTGTGAAGTA 

HGGY YIG WVS VQDQ VES LVK QQVS 
421 CACGGTGGAT ATTATATTGG ATGGGTCTCC GTCCAGGACC AAGTCGAGTC GCTTGTCAAA CAGCAGGTTA 

* QYP DYA LTVT GHX LGA SLAA LTA 
491 GCCAGTATCC GGACTACGCG CTGACCGTGA CCGGCCACKC CCTCGGCGCC TCCCTGGCGG CACTCACTGC 

* A Q L SATY DNI RLY TFGE PRS GNQ 
561 CGCCCAGCTG TCTGCGACAT ACGACAACAT CCGCCTGTAC ACCTTCGGCG AACCGCGCAG CGGCAATCAG 

Xhol 



AFAS YMN DAF QASS PDT TQY FRVT 
631 GCCTTCGCGT CGTACATGAA CGATGCCTTC CAAGCCTCGA GCCCAGATAC GACG CAGTAT TTCCGGGTCA 

NCOI 



• HAN DGI PNLP PVE QGY AHGG VEY 
701 CTCATGCCAA CGACGGCATC CCAAACCTGC CCCCGGTGGA GCAGGGGTAC GCCCATGGCG GTGTAGAGTA 

* W S V DPYS AQN TFV CTGD EVQ CCE 
771 CTGGAGCGTT GATCCTTACA GCGCCCAGAA CACATTTGTC TGCACTGGGG ATGAAGTGCA GTGCTGTGAG 

SphI 

AQGG QGV NNA HTTY FGM TSG ACTW 
841 GCCCAGGGCG GACAGGGTGT GAATAATGCG CACACGACTT ATTTTGGGAT GACGAGCGGC GCATGCACCT 



Figure 2^C 



Not I KDEL 



* PVAAAETTEG* 
911 GGCCGGTCGC GGCCGCGGAA ACCACTGAAG GATGAGCTGT AAAGAAGCAG ATCGTTCAAA CATTTGGCAA 
981 TAAAGTTTCT TAAGATTGAA TCCTGTTGCC GGTCTTGCGA TGATTAT CAT ATAATTTCTG TTGAATTACG 
1051 TTAAGCATGT AATAATTAAC ATGTAATG C A TGACGTTATT TATGAGATGG GTTTTTATGA TTAGAGTCCC 
1121 GGAATTATAC ATTTAATACG CGATAGAAAA CAAAATATAG CGCGCAAACT AGGATAAATT ATCGCGCGCG 

Hindi I I 



Clal Xbal 



1191 GTGTCATCTA TGTTACTAGA TCGATAAGCT TCTAGAGCGG CCGGTGGAGC TCCAATTCGC CCTATAGTGA 

12 61 GTCGTATTAC GCGCGCTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG CGTTACCCAA 

13 31 CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC ACCGATCGCC 

14 01 CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG 
1471 TGTGGTGGTT ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT CGCTTTCTTC 
1541 CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCGCGTCAAG CTCTAAATCG GGGGCTCCCT TTAGGGTTCC 
1611 GATTTAGTGC TTTACGGCAC CTCGACCCCA AAAAACTTGA TTAGGGTGAT GGTTCACGTA GTGGGCCATC 
1681 GCCCTGATAG ACGGTTTTTC GCCCTTTGAC GTTGGAGTCC ACGTTCTTTA ATAGTGGACT CTTGTTCCAA 
1751 AC TGGAACAA CACTCAACCC TATCTCGGTC TATTCTTTTG ATTTATAAGG GATTTTGCCG ATTTCGGCCT 
1821 ATTGGTTAAA AAATGAGCTG ATTTAACAAA AATTTAACGC GAATTTTAAC AAAATATTAA CGCTTACAAT 
18 91 TTAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATT CAAATA 
1961 TGTATCCGCT CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT 
2 031 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC TCACCCAGAA 
2101 ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG TTACATCGAA CTGGATCTCA 
2171 AC AGC GGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT 
2241 GCTATGTGGC GCGGTATTAT CCCGTATTGA CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT 
2311 CAGAATGACT TGGTTGAGTA CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT 

23 81 TATGCAGTGC TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC 

24 51 GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT GTAACTCGCC TTGATCGTTG GGAACCGGAG 
2521 CTGAATGAAG C CATAC C AAA CGACGAGCGT GACACCACGA TGCCTGTAGC AATGGCAACA ACGTTGCGCA 
2591 AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA GACTGGATGG AGGCGGATAA 
2 661 AGTTGCAGGA CCACTTCTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT 
2 731 GAGCGTGGGT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT 
2 801 ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT 
2 871 TAAGCATTGG TAACTGTCAG ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA 

2 941 TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT CCCTTAACGT GAGTTTTCGT 

3 011 TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT 
3 081 CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC CGGAT CAAGA GCTACCAACT 
3151 CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT 
3221 TAGGCCACCA CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 
32 91 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG 
3 3 61 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT 
3 431 ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT ATCCGGTAAG 
35 01 CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT 
3571 GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA 
3 641 AAAACGCCAG CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 
3 711 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC 
3 781 CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC AATACGCAAA CCGCCTCTCC 
3 851 CCGCGCGTTG GCCGATTCAT TAATGCAGCT GGCACGACAG GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG 
3 921 CAACGCAATT AATGTGAGTT AGCTCACTCA TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT 
3 991 ATGTTGTGTG GAATTGTGAG CGGATAACAA TTTCACACAG GAAACAGCTA TGACCATGAT TACGCCAAGC 



Figure 3^ D 

Kpnl 



Xhol 



4061 GCGCAATTAA CCCTCACTAA AGGGAACAAA 

4131 TGAGAAGAGA GTCGGGATAG TCCAAAATAA 

42 01 TTAAAAGGTG GTATAAGTAA AATATCGGTA 

4271 TTATAAAAAT TGAGGATGTT TTGTCGGTAC 

4341 TTCGCGATTT GGAAATGCAT ATCTGTATTT 

4411 GATTTGTATA AGAAAT AT C T TTAAAAAACC 
EcoRI 



44 81 TCAGGCGAAT TCCACAATGA ACAATAATAA 

4551 TTCTAGTAAA ATAAAAGATA AACTTAGACT 

4621 AAAGTGCTAT GCACGATCCA TAGCAAGCCC 

46 91 ACTGGCAAAT AGTCTCCACC CCCGGCACTA 

4761 AAAAAAAAAA AAGAAAGAAA AAAAAGAAAA 

48 31 AGCGAGGAGG ATCGCGAGCA GCGACGAGGC 

4 9 01 CACTATATAC ATACCCCCCC CTCTCCTCCC 

4971 CCCCCTCGCT GCCGGACGAC GAGCTCCTCC 

5041 CTCCTCTTTC TTTCTCCGTT TTTTTTTTCG 

5111 AGAGCGGCTT CGTCGCCCAG ATCGGTGCGC 
BamHI 



5181 GTCGGCCCGG ATCCTCGCGG GGAATGGGGC 
52 51 GAATTTGAAT CCCTCAGCAT TGTTCATCGG 
5321 GCGGAGCTTT TTTGTAGC 



AGCTGGGTAC CGGGCCCCCC CTCGAGGTCA TTCATATGCT 
AACAAAGGTA AGATTACCTG GTCAAAAGTG AAAACATCAG 
ATAAAAGGTG GCCCAAAGTG AAATTTACTC TTTTCTACTA 
TTTGATACGT CATTTTTGTA TGAATTGGTT TTTAAGTTTA 
GAGTCGGTTT TTAAGTTCGT TGCTTTTGTA AATACAGAGG 
CATATGCTAA TTTGACATAA TTTTTGAGAA AAATATATAT 



GATTAAAATA GCTTGCCCCC GTTGCAGCGA TGGGTATTTT 
CAAAACATTT ACAAAAACAA CCCCTAAAGT CCTAAAGCCC 
AGCCCAACCC AACCCAACCC AACCCACCCC AGTGCAGCCA 
TCACCGTGAG TTGTCCGCAC CAGCGCACGT CTCGCAGCCA 
AGAAAAACAG CAGGTGGGTC CGGGTCGTGG GGGCCGGAAA 
CCGGCCCTCC CTCCGCTTCC AAAGAAACGC CCCCCATCGC 
ATCCCCCCAA CCCTACCACC ACCACCACCA CCACCTCCTC 
CCCCTCCCCC TCCGCCGCCG CCGGTAACCA CCCCGCCCCT 
TCTCGGTCTC GATCTTTGGC CTTGGTAGTT TGGGTGGGCG 
GGGAGGGGCG GGATCTGGCG GCTGGCGTCT CCGGGCGTGA 
Bglll 



TCTCGGATGT AGATCTTCTT TCTTTCTTCT TTTTGTGGTA 
TAGTTTTTCT TTTCATGATT TGTGACAAAT GCAGCCTCGT 




Figure 31 A 



Figure ,33 B 

NCOl 



Hindi II 



M AHA RVL LLAL A V L ATA AVAV 
1 AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTCGCCG 

Not I 



• ASS R A A ASTQ GIS EDL YSRL VEM 
71 TCGCCTCCTC CCGCGCGGCC GCCTCCACGC AGGGCATCTC CGAAGACCTC TACAGCCGTT TAGTCGAAAT 

Sail 



-ATI SQAA YAD LCN IPST I I K GEK 
141 GGCCACTATC TCCCAAGCTG CCTACGCCGA CCTGTGCAAC ATTCCGTCGA CTATTATCAA GGGAGAGAAA 

BamHI 



IYNS QTD ING WILR DDS SKE I I T V 
211 ATTTACAATT CTCAAACTGA CATTAACGGA TGGATCCTCC GCGACGACAG CAGCAAAGAA ATAATCACCG 
• FRG TGS DTNL QLD TNY TLTP FDT 

2 81 TCTTCCGTGG CACTGGTAGT GATACGAATC TACAACTCGA TACTAACTAC ACCCTCACGC CTTTCGACAC 

-LPQ CNGC EVH GGY YIGW VSV QDQ 

3 51 CCTACCACAA TGCAACGGTT GTGAAGTACA CGGTGGATAT TATATTGGAT GGGTCTCCGT CCAGGACCAA 

VESL VKQ QVS QYPD YAL T V T GHXL 
421 GTCGAGTCGC TTGTCAAACA GCAGGTTAGC CAGTATCCGG ACTACGCGCT GACCGTGACC GGCCACKCCC 
- GAS L A A LTAA QLS ATY DNIR LYT 

4 91 TCGGCGCCTC CCTGGCGGCA CTCACTGCCG CCCAGCTGTC TGCGACATAC GACAACATCC GCCTGTACAC 

Xhol 

■FGE PRSG NQA FAS YMND AFQ ASS 
561 CTTCGGCGAA CCGCGCAGCG GCAATCAGGC CTTCGCGTCG TACATGAACG ATGCCTTCCA AGCCTCGAGC 

PDTT QYF RVT HAND GIP NLP PVEQ 
631 CCAGATACGA CGCAGTATTT CCGGGTCACT CATGCCAACG ACGGCATCCC AAACCTGCCC CCGGTGGAGC 
NCOl 



GYA HGG VEYW SVD PYS AQNT FVC 
701 AGGGGTACGC CCATGGCGGT GTAGAGTACT GGAGCGTTGA TCCTTACAGC GCCCAGAACA CATTTGTCTG 

* T G D EVQC CEA QGG QGVN NAH TTY 
7 71 CACTGGGGAT GAAGTGCAGT GCTGTGAGGC CCAGGGCGGA CAGGGTGTGA ATAATGCGCA CACGACTTAT 

FGMT SGA CTW * 
841 TTTGGGATGA CGAGCGGAGC CTGTACATGG TGATCAGTCA TTTCAGCCTC CCCGAGTGTA CCAGGAAAGA 
911 TGGATGTCCT GGAGAGGGGG CCGCGTAACC A G TGAAGGAT GAGCTGTAAA GAAGGAGATC GTTCAAACAT 

9 81 TTGGCAATAA AGTTTCTTAA GATTGAATCC TGTTGCCGGT CTTGCGATGA TTATCATATA ATTTCTGTTG 
1051 AATTACGTTA AGCATGTAAT AATTAACATG TAATGCATGA CGTTATTTAT GAGATGGGTT TTTATGATTA 
1121 GAGTCCCGCA ATTATACATT TAATA CGCGA TAGAAAACAA AATATAGCGC GCAAACTAGG ATAAATTATC 

Hindlll 



Clal Xbal 



1191 GCGCGCGGTG TCATCTATGT TACTAGATCG ATAAGCTTCT AGAGCGGCCG GTGGAGCTCC AATTCGCCCT 

1261 ATAGTGAGTC GTATTACGCG CGCTCACTGG CCGTCGTTTT ACAACGTCGT GACT GGGAAA ACCCTGGCGT 

13 31 TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA ATAGCGAAGA GGCCCGCACC 
1401 GATCGCCCTT CCCAACAGTT GCGCAGCCTG AATGGCGAAT GGGACGCGCC CTGTAGCGGC GCATTAAGCG 

14 71 CGGCGGGTGT GGTGGTTACG CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 
1541 TTTCTTCCCT TCCTTTCTCG CCACGTTCGC CGGCTTTCCC CGTCAAGCTC TAAATCGGGG GCTCCCTTTA 
1611 GGGTTCCGAT TTAGTGCTTT ACGGCACCTC GACCCCAAAA AACTTGATTA GGGTGATGGT TCACGTAGTG 

16 81 GGCCATCGCC CTGATAGACG GTTTTTCGCC CTTTGACGTT GGAGTCCACG TTCTTTAATA GTGGACTCTT 

17 51 GTTCCAAACT GGAACAACAC TCAACCCTAT CTCGGTCTAT TCTTTTGATT TATAAGGGAT TTTGCCGATT 
1821 TCGGCCTATT GGTTAAAAAA TGAGCTGATT TAACAAAAAT TTAACGCGAA TTTTAACAAA ATATTAACGC 

18 91 TTACAATTTA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA ACCCCTATTT GTTTATTTTT CTAAATACAT 
1961 TCAAATATGT ATCCGCTCAT GAGACAATAA CCCTGATAAA TGCTTCAATA ATATTGAAAA AGGAAGAGTA 



Figure 33 C 

2031 tgagtattca'acatttccgt gtcgccctta ttccCttttt tgcggcattt tgccttcctg tttttgctca 

2101 cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg 

2171 gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta 

2241 aagttctgct atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 

2311 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg cat gac agta 

23 81 agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 

2451 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga 

2 521 ACCGGAGCTG AATGAAGCCA TACCAAACGA CGAGCGTGAC ACCACGATGC CTGTAGCAAT GGCAACAACG 

2 5 91 TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC TGGATGGAGG 

2 6 61 CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG TTTATTGCTG ATAAATCTGG 

2731 AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG GGGCCAGATG GTAAGCCCTC CCGTATCGTA 

2 8 01 GTTATCTACA CGACGGGGAG TCAGGCAACT ATGGATGAAC GAAATAGACA GATCGCTGAG ATAGGTGCCT 

2 871 CACTGATTAA GCATTGGTAA CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA 

2 941 TTTTTAATTT AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG 

3 011 TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT TTTTTTCTGC 
3 0 81 GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG AT CAAGAG CT 
3151 ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTCCT TCTAGTGTAG 
3221 CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC GTACATACCT CGCTCTGCTA ATCCTGTTAC 
3 2 91 CAGTGGCTGG TGCCAGTGGC GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TAC CGGATAA 
33 61 GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA 
3431 CTGAGATACC TACAGCGTGA GCTATGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GACAGGTATC 
3 5 01 CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA 
3 5 71 TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC 
3 641 CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG CTGGCCTTTT GCTCACATGT 
3 711 TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT TACCGCCTTT GAGTGAGCTG ATACCGCTCG 
3 7 81 CCGCAGCCGA ACGAC CGAGC GCAGCGAGTC AGTGAGCGAG GAAGCGGAAG AGCGCCCAAT ACGCAAACCG 
3851 CCTCTCCCCG CGCGTTGGCC GATTCATTAA TGCAGCTGGC ACGACAGGTT TCCCGACTGG AAAGCGGGCA 
3 921 GTGAGCGCAA CGCAATTAAT GTGAGTTAGC TCACTCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC 
3 9 91 GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT CACACAGGAA ACAGCTATGA CCATGATTAC 

Kpnl Xhol 



4 0 61 GCCAAGCGCG CAATTAACCC TCACTAAAGG GAACAAAAGC TGGGTACCGG GCCCCCCCTC GAGGTCATTC 

4131 ATATGCTTGA GAAGAGAGTC GGGATAGTCC AAAATAAAAC AAAGGTAAGA TTACCTGGTC AAAAGTGAAA 

42 01 ACATCAGTTA AAAGGTGGTA TAAGTAAAAT AT CGGTAATA AAAGGTGGCC CAAAGTGAAA TTTACTCTTT 

42 71 TCTACTATTA TAAAAATTGA GGATGTTTTG TCGGTACTTT GATACGTCAT TTTTGTATGA ATTGGTTTTT 

4341 AAGTTTATTC GCGATTTGGA AATGCATATC TGTATTTGAG TCGGTTTTTA AGTTCGTTGC TTTTGTAAAT 

4411 ACAGAGGGAT TTGTATAAGA AATATCTTTA AAAAACCCAT ATGCTAATTT GACATAATTT TTGAGAAAAA 

EcoRI 



44 81 TATATATTCA GGCGAATTCC ACAATGAACA ATAATAAGAT TAAAATAGCT TGCCCCCGTT GCAGCGATGG 

45 51 GTATTTTTTC TAGTAAAATA AAAGATAAAC TTAGACTCAA AACATTTACA AAAACAACCC CTAAAGTCCT 
4621 AAAGCCCAAA GTGCTATGCA CGATCCATAG CAAGCCCAGC CCAACCCAAC CCAACCCAAC CCACCCCAGT 

46 91 GCAGCCAACT GGCAAATAGT CTCCACCCCC GGCACTATCA CCGTGAGTTG TCCGCACCAC CGCACGTCTC 
4761 GCAGCCAAAA AAAAAAAAAG AAAGAAAAAA AAGAAAAAGA AAAACAGCAG GTGGGTCCGG GTCGTGGGGG 
4 831 CCGGAAAAGC GAGGAGGATC GCGAGCAGCG ACGAGGCCCG GCCCTCCCTC CGCTTCCAAA GAAACGCCCC 
4 901 CCATCGCCAC TATATACATA CCCCCCCCTC TCCTCCCATC CCCCCAACCC TACCACCACC ACCACCACCA 

4 971 CCTCCTCCCC CCTCGCTGCC GGACGACGAG CTCCTCCCCC CTCCCCCTCC GCCGCCGCCG GTAACCACCC 

5 041 CGCCCCTCTC CTCTTTCTTT CTCCGTTTTT TTTTTCGTCT CGGTCTCGAT CTTTGGCCTT GGTAGTTTGG 
5111 GTGGGCGAGA GCGGCTTCGT CGCCCAGATC GGTGCGCGGG AGGGGCGGGA TCTCGCGGCT GGCGTCTCCG 

BamHI Bglll 



5181 GGCGTGAGTC GGCCCGGATC CTCGCGGGGA ATGGGGCTCT CGGATGTAGA TCTTCTTTCT TTCTTCTTTT 
5251 TGTGGTAGAA TTTGAATCCC TCAGCATTGT TCATCGGTAG TTTTTCTTTT CATGATTTGT GACAAATGCA 
5321 GCCTCGTGCG GAGCTTTTTT GTAGC 




Figure 3^ A 



Figure 3 C / B 



NCOl 

Hindlll 



M AHA RVL LLAL A V L ATA AVAV 
AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTCGCCG 

Not I 



. ASS SSF ADSN PIR PVT DRAA AST 
71 TCGCCTCCTC CTCCTCCTTC GCCGACTCCA ACCCGATCCG GCCCGTCACC GACCGCGCGG CCGCCTCCAC 
. Q G I SEDL Y S R LVE MATI SQA AYA 
141 GCAGGGCATC TCCGAAGACC TCTACAGCCG TTTAGTCGAA ATGGCCACTA TCTCCCAAGC TGCCTACGCC 

Sail 



DLCN IPS Til KGEK IYN SQT DING 
211 GACCTGTGCA ACATTCCGTC GACTATTATC AAGGGAGAGA AAATTTACAA TTCTCAAACT GACATTAACG 
BamHI 



- WIL RDD SSKE I I T VFR GTGS D T N • 
281 GATGGATCCT CCGCGACGAC AGCAGCAAAG AAATAATCAC CGTCTTCCGT GGCACTGGTA GTGATACGAA 

. L Q L DTNY TLT PFD TLPQ CNG CEV 
351 TCTACAACTC GATACTAACT ACACCCTCAC GCCTTTCGAC ACCCTACCAC AATGCAACGG TTGTGAAGTA 

HGGY YIG WVS VQDQ VES LVK QQVS 
421 CACGGTGGAT ATTATATTGG ATGGGTCTCC GTCCAGGACC AAGTCGAGTC GCTTGTCAAA CAGCAGGTTA 

• QYP DYA LTVT GHX LGA SLAA L T A ■ 
4 91 GCCAGTATCC GGACTACGCG CTGACCGTGA CCGGCCACKC CCTCGGCGCC TCCCTGGCGG CACTCACTGC 

. A Q L SATY DNI RLY TFGE PRS GNQ 
561 CGCCCAGCTG TCTGCGACAT ACGACAACAT CCGCCTGTAC ACCTTCGGCG AACCGCGCAG CGGCAATCAG 

Xhol 



AFAS YMN DAF QASS PDT TQY FRVT 
631 GCCTTCGCGT CGTACATGAA CGATGCCTTC CAAGCCTCGA GCCCAGATAC GACGCAGTAT TTCCGGGTCA 

NCOl 



• HAN DGI PNLP PVE QGY AHGG V E Y - 
701 CTCATGCCAA CGACGGCATC CCAAACCTGC CCCCGGTGGA GCAGGGGTAC GCCCATGGCG GTGTAGAGTA 

• WSV DPYS AQN TFV CTGD EVQ CCE 
771 CTGGAGCGTT GATCCTTACA GCGCCCAGAA CACATTTGTC TGCACTGGGG ATGAAGTGCA GTGCTGTGAG 

SphI 

AQGG QGV NNA HTTY FGM TSG ACTW 
841 GCCCAGGGCG GACAGGGTGT GAATAATGCG CACACGACTT ATTTTGGGAT GACGAGCGGC GCATGCACCT 
NotI 



* PVA A A E PLKD EL* 
911 GGCCGGTCGC GGCCGCGGAA CCACTGAAGG ATGAGCTGTA AAGAAGCAGA TCGTTCAAAC ATTTGGCAAT 
981 AAAGTTTCTT AAGATTGAAT CCTGTTGCCG GTCTTGCGAT GATTATCATA TAATTTCTGT TGAATTACGT 
1051 TAAGCATGTA ATAATTAACA TGTAATGCAT GACGTTATTT ATGAGATGGG TTTTTATGAT TAGAGTCCCG 
1121 CAATTATACA TTTAATACGC G AT AGAAAA C AAAATATAGC GCGCAAACTA GGATAAATTA TCGCGCGCGG 

Hindlll 



Clal Xbal 



1191 TGTCATCTAT GTTACTAGAT CGATAAGCTT CTAGAGCGGC CGGTGGAGCT CCAATTCGCC CTATAGTGAG 

1261 TCGTATTACG CGCGCTCACT GGCCGTCGTT TTACAACGTC GTGACTGGGA AAACCCTGGC GTTACCCAAC 

1331 TTAATCGCCT TGCAGCACAT CCCCCTTTCG CCAGCTGGCG TAATAGCGAA GAGGCCCGCA CCGATCGCCC 

1401 TTCCCAACAG TTGCGCAGCC TGAATGGCGA ATGGGACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT 

1471 GTGGTGGTTA CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC 

1541 CTTCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG GGGCTCCCTT TAGGGTTCCG 

1611 ATTTAGTGCT TTACGGCACC TCGACCCCAA AAAACTTGAT TAGGGTGATG GTTCACGTAG TGGGCCATCG 



Figure 39 C 

16 81 CCCTGATAGA CGGTTTTTCG CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA 

1751 CTGGAACAAC ACTCAACCCT ATCTCGGTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA TTTCGGCCTA 

1821 TTGGTTAAAA AATGAGCTGA TTTAACAAAA ATTTAACGCG AATTTTAACA AAATATTAAC GCTTACAATT 

1891 TAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC ATTCAAATAT 

1961 GTATCGGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA AAAGGAAGAG TATGAGTATT 

2 031 CAACATTTCC GTGTCGCCCT TATTCCCTTT TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT CACCCAGAAA 

2101 CGCTGGTGAA AGTAAAAGAT GCTGAAGATC AGTTGGGTGC ACGAGTGGGT TACATCGAAC TGGATCTCAA 

2171 CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC CGAAGAACGT TTTCCAATGA TGAGCACTTT TAAAGTTCTG 

2241 CTATGTGGCG CGGTATTATC CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA CACTATTCTC 

2311 AGAATGACTT GGTTGAGTAC TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAG TAAGAGAATT 

23 81 ATGCAGTGCT GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG 

2451 AAGGAGCTAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT TGATCGTTGG GAACCGGAGC 

2 521 TGAATGAAGC CATACCAAAC GACGAGCGTG ACACCACGAT GCCTGTAGCA ATGGCAACAA CGTTGCGCAA 

2 5 91 ACTATTAACT GGCGAACTAC TTACTCTAGC TTCCCGGCAA CAATTAATAG ACTGGATGGA GGCGGATAAA 

2 6 61 GTTGCAGGAC CACTTCTGCG CTCGGCCCTT CGGGCTGGCT GGTTTATTGC TGATAAATCT GGAGCCGGTG 

2 731 AGCGTGGGTC TCGCGGTATC ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA 

2 8 01 CACGACGGGG AGTCAGGCAA C TAT GGATGA ACGAAATAGA CAGAT CGCTG AGATAGGTGC CTCACTGATT 

2 8 71 AAGCATTGGT AACTGTCAGA CCAAGTTTAC TCATATATAC TTTAGATTGA TTTAAAACTT CATTTTTAAT 

2 941 TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT GACCAAAATC CCTTAACGTG AGTTTTCGTT 

3 011 CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT TCTTGAGATC CTTTTTTTCT GCGCGTAATC 
3 081 TGCTGCTTGC AAACAAAAAA ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC 
3151 TTTTTCCGAA GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGTC CTTCTAGTGT AGCCGTAGTT 
3221 AGGCCACCAC TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT AC C AGTGGC T 
32 91 GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT AAGGCGCAGC 
3 3 61 GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT GGAGCGAACG ACCTACACCG AACTGAGATA 
3 431 CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA GGGAGAAAGG CGGACAGGTA TCCGGTAAGC 
3 5 01 GGCAGGGTCG GAACAGGAGA GCGCACGAGG GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTCCTG 
3 5 71 TCGGGTTTCG CCACCTGTGA CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA 
3 641 AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT GTTCTTTCCT 
3 711 GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC 
3 7 81 GAACGACCGA GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCCA ATACGCAAAC CGCCTCTCCC 
3 851 CGCGCGTTGG CCGATTCATT AATGCAGCTG GCAC GACAGG TTTCCCGACT GGAAAGCGGG CAGTGAGCGC 
3 921 AACGCAATTA ATGTGAGTTA GCTCACTCAT TAGGCACCCC AGGCTTTACA CTTTATGCTT CCGGCTCGTA 
3 9 91 TGTTGTGTGG AATTGTGAGC GGATAACAAT TTCACACAGG AAACAGCTAT GACCATGATT ACGCCAAGCG 

Kpnl Xhol 



4 061 CGCAATTAAC CCTCACTAAA GGGAACAAAA GCTGGGTACC GGGCCCCCCC TCGAGGTCAT TCATATGCTT 

4131 GAGAAGAGAG TCGGGATAGT CCAAAATAAA ACAAAGGTAA GATTACCTGG TCAAAAGTGA AAACATCAGT 

42 01 TAAAAGGTGG TATAAGTAAA ATATCGGTAA TAAAAGGTGG CCCAAAGTGA AATTTACTCT TTTCTACTAT 

42 71 TATAAAAATT GAGGATGTTT TGTCGGTACT TTGATACGTC ATTTTTGTAT GAATTGGTTT TTAAGTTTAT 

4341 TCGCGATTTG GAAATGCATA TCTGTATTTG AGTCGGTTTT TAAGTTCGTT GCTTTTGTAA ATACAGAGGG 

4411 ATTTGTATAA GAAATATCTT TAAAAAACCC ATATGCTAAT TTGACATAAT TTTTGAGAAA AATATATATT 
EcoRI 



44 81 CAGGCGAATT CCACAATGAA CAATAATAAG ATTAAAATAG CTTGCCCCCG TTGCAGCGAT GGGTATTTTT 

4551 TCTAGTAAAA TAAAAGATAA ACTTAGACTC AAAACATTTA CAAAAACAAC CCCTAAAGTC CTAAAGCCCA 

4621 AAGTGCTATG CACGATCCAT AGCAAGCCCA GCCCAACCCA ACCCAACCCA ACCCACCCCA GTGCAGCCAA 

46 91 CTGGCAAATA GTCTCCACCC CCGGCACTAT CACCGTGAGT TGTCCGCACC ACCGCACGTC TCGCAGCCAA 

4 761 AAAAAAAAAA AGAAAGAAAA AAAAGAAAAA GAAAAACAGC AGGTGGGTCG GGGTCGTGGG GGCCGGAAAA 

4 831 GCGAGGAGGA TCGCGAGCAG CGACGAGGCC CGGCCCTCCC TCCGCTTCCA AAGAAACGCC CCCCATCGCC 

4 901 ACTATATACA TACCCCCCCC TCTCCTCCCA TCCCCCCAAC CCTACCACCA CCACCACCAC CACCTCCTCC 

4 971 CCCCTCGCTG CCGGACGACG AGCTCCTCCC CCCTCCCCCT CCGCCGCCGC CGGTAACCAC CCCGCCCCTC 

5 041 TCCTCTTTCT TTCTCCGTTT TTTTTTTCGT CTCGGTCTCG ATCTTTGGCC TTGGTAGTTT GGGTGGGCGA 
5111 GAGCGGCTTC GTCGCCCAGA TCGGTGCGCG GGAGGGGCGG GATCTCGCGG CTGGCGTCTC CGGGCGTGAG 

BamHI Bglll 



5181 TCGGCCCGGA TCCTCGCGGG GAATGGGGCT CTCGGATGTA GATCTTCTTT CTTTCTTCTT TTTGTGGTAG 
5251 AATTTGAATC CCTCAGCATT GTTCATCGGT AGTTTTTCTT TTCATGATTT GTGACAAATG CAGCCTCGTG 
5321 CGGAGCTTTT TTGTAGC 




Figure 2T_A 



Figure 



Ncol 

BamHI 

M AHAR VLL LAL A V L A T A A • 
1 CCTGACGCCG AGGATCCATG GCCCACGCCC GCGTCGTCCT CCTGGCGCTC GCCGTGCTGG CCACGGCCGC 

Not I 

-VAV ASSS SFA DSN PGRP VTD RAA 
71 CGTCGCCGTC GCCTCCTCCT CCTCCTTCGC CGACTCCAAC QCGGGCCGGC CCGTCACCGA CCGCGCGGCC 
Not I 

ASTQ GIS EDL YSRL VEM ATI SQAA 
141 GCCTCCACGC AGGGCATCTC CGAAGACCTC TACAGCCGTT TAGTCGAAAT GGCCACTATC TCCCAAGCTG 

Sail 

Y A D LCN IPST I I K GEK IYNS QTD* 
211 CCTACGCCGA CCTGTGCAAC ATTCCGTCGA CTATTATCAA GGGAGAGAAA ATTTACAATT CTCAAACTGA 
BamHI 



• I N G WILR DDS SKE I I T V FRG TGS 

2 81 CATTAACGGA TGGATCCTCC GCGACGACAG CAGCAAAGAA ATAATCACCG TCTTCCGTGG CACTGGTAGT 

DTNL QLD TNY TLTP FDT LPQ CNGC 

3 51 GATACGAATC TACAACTCGA TACTAACTAC ACCCTCACGC CTTTCGACAC CCTACCACAA TGCAACGGTT 

• EVH GGY YIGW VSV QDQ VESL V K Q • 
421 GTGAAGTACA CGGTGGATAT TATATTGGAT GGGTCTCCGT CCAGGACCAA GTCGAGTCGC TTGTCAAACA 

• Q V S QYPD YAL TVT GHXL GAS LAA 

4 91 GCAGGTTAGC CAGTATCCGG ACTACGCGCT GACCGTGACC GGCCACKCCC TCGGCGCCTC CCTGGCGGCA 

LTAA QLS ATY DNIR LYT FGE PRSG 
561 CTCACTGCCG CCCAGCTGTC TGCGACATAC GACAACATCC GCCTGTACAC CTTCGGCGAA CCGCGCAGCG 

Xhol 



• NQA FAS YMND AFQ ASS PDTT Q Y F • 
631 GCAATCAGGC CTTCGCGTCG TACATGAACG ATGCCTTCCA AGCCTCGAGC CCAGATACGA CGCAGTATTT 

Ncol 

• R V T HAND GIP NLP PVEQ GYA HGG 
7 01 CCGGGTCACT CATGCCAACG ACGGCATCCC AAACCTGCCC CCGGTGGAGC AGGGGTACGC CCATGGCGGT 

VEYW SVD PYS AQNT FVC TGD EVQC 
771 GTAGAGTACT GGAGCGTTGA TCCTTACAGC GCCCAGAACA CATTTGTCTG CACTGGGGAT GAAGTGCAGT 

• CEA QGG QGVN NAH TTY FGMT S G A ■ 
841 GCTGTGAGGC CCAGGGCGGA CAGGGTGTGA ATAATGCGCA CACGACTTAT TTTGGGATGA CGAGCGGAGC 

• C T W * 

911 CTGTACATGG TGATCAGTCA TTTCAGCCTC CCCGAGTGTA CCAGGAAAGA TGGATGTCCT GGAGAGGGGG 
981 CCGCGTAACC ACTGAAGGAT GAGCTGTAAA GAAGCAGATC GTTCAAACAT TTGGCAATAA AGTTTCTTAA 
1051 GATTGAATCC TGTTGCCGGT CTTGCGATGA T TAT CAT ATA ATTTCTGTTG AATTACGTTA AGCATGTAAT 
1121 AATTAACATG TAAT GCATGA CGTTATTTAT GAGATGGGTT TTTATGATTA GAGTCCCGCA ATTATACATT 
1191 TAATACGGGA TAGAAAACAA AATATAGCGC GCAAACTAGG ATAAATTATC GCGCGCGGTG TCATCTATGT 

Hindlll 



Clal Xbal 



12 61 TACTAGATCG ATAAGCTTCT AGAGCGGCCG GTGGAGCTCC AATTCGCCCT ATAGTGAGTC GTATTACGCG 

13 31 CGCTCACTGG CCGTCGTTTT ACAACGTCGT GACTGGGAAA ACCCTGGCGT TACCCAACTT AATCGCCTTG 
1401 CAGCACATCC CCCTTTCGCC AGCTGGCGTA ATAGCGAAGA GGCCCGCACC GATCGCCCTT CCCAACAGTT 

14 71 GCGCAGCCTG AATGGCGAAT GGGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 
1541 CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC TTTCTTCCCT TCCTTTCTCG 
1611 CCACGTTCGC CGGCTTTCCC CGTCAAGCTC TAAATCGGGG GCTCCCTTTA GGGTTCCGAT TTAGTGCTTT 
1681 ACGGCACCTC GACCCCAAAA AACTTGATTA GGGTGATGGT TCACGTAGTG GGCCATCGCC CTGATAGACG 
1751 GTTTTTCGCC CTTTGACGTT GGAGTCCACG TTCTTTAATA GTGGACTCTT GTTCCAAACT GGAACAACAC 



Figure 3T~ C 



1821 TCAACCCTAT CTCGGTCTAT TCTTTTGATT TATAAGGGAT TTTGCCGATT TCGGCCTATT GGTTAAAAAA 

18 91 TGAGCTGATT TAACAAAAAT TTAACGCGAA TTTTAACAAA ATATTAACGC TTACAATTTA GGTGGCACTT 

1961 TTCGGGGAAA TGTGCGCGGA ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT 

2 031 GAGACAATAA CCCTGATAAA TGCTTCAATA ATATTGAAAA AGGAAGAGTA TGAGTATTCA ACATTTCCGT 

2101 GTCGCCCTTA TTCCCTTTTT TGCGGCATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG CTGGTGAAAG 

2171 TAAAAGATGC TGAAGATCAG TTGGGTGCAC GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT 

2241 CCTTGAGAGT TTTCGCCCCG AAGAACGTTT TCCAATGATG AGCACTTTTA AAGTTCTGCT ATGTGGCGCG 

2 311 GTATTATCCC GTATTGACGC CGGGCAAGAG CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG 

23 81 TTGAGTACTC ACCAGTCACA GAAAAGCATC TTACGGATGG CATGACAGTA AGAGAATTAT GCAGTGCTGC 

24 51 CATAACCATG AGTGATAACA CTGCGGCCAA CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC 
2 521 GCTTTTTTGC ACAACATGGG GGATGATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG AATGAAGCCA 
2 5 91 TACCAAACGA CGAGCGTGAC ACCACGATGC CTGTAGCAAT GGCAACAACG TTGCGCAAAC TATTAACTGG 
2 661 CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC TGGATGGAGG CGGATAAAGT TGCAGGACCA 
2731 GTTCTGCGCT CGGCCCTTCC GGCTGGCTGG TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC 
2 8 01 GCGGT AT CAT TGCAGCACTG GGGCCAGATG GTAAGCCGTC CCGTATCGTA GTTATCTACA CGACGGGGAG 
2 8 71 TCAGGCAACT ATGGATGAAC GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA 

2 941 CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT AAAAGGATCT 

3 011 AGGT GAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC 
3 081 AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG CTGCTTGCAA 
3151 ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG 
3221 TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTCCT T C TAG TGTAG CCGTAGTTAG GCCACCACTT 
32 91 CAAGAACTCT GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC 
3 3 61 GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TAGCGGATAA GGCGCAGCGG TCGGGCTGAA 
3431 CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA 
3 5 01 GCTATGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA 
3 571 ACAGGAGAGC GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC 
3 641 ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC CTATGGAAAA ACGCCAGCAA 
3 711 CGCGGCCTTT TTACGGTTCC TGGCCTTTTG CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT 
3 781 GATTCTGTGG ATAACCGTAT TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC 
3 851 GCAGCGAGTC AGTGAGCGAG GAAGCGGAAG AGCGCCCAAT ACGCAAACCG CCTCTCCCCG CGCGTTGGCC 
3 921 GATTCATTAA TGCAGCTGGC ACGACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA CGCAATTAAT 
3 9 91 GTGAGTTAGC TCACTCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC GGCTCGTATG TTGTGTGGAA 
40 61 TTGTGAGCGG ATAACAATTT CACACAGGAA AC AG C T ATGA C CAT GAT T AC GCCAAGCGCG CAATTAACCC 

Kpnl Xhol 



4131 TCACTAAAGG GAACAAAAGC TGGGTACCGG GCCCCCCCTC GAGGTCATTC ATATGCTTGA GAAGAGAGTC 
4201 GGGATAGTCC AAAATAAAAC AAAGGTAAGA TTACCTGGTC AAAAGTGAAA ACATCAGTTA AAAGGTGGTA 
42 71 TAAGTAAAAT ATCGGTAATA AAAGGTGGCC CAAAGTGAAA TTTACTCTTT TCTACTATTA TAAAAATTGA 
4 341 GGATGTTTTG TCGGTACTTT GATACGTCAT TTTTGTATGA ATTGGTTTTT AAGTTTATTC GCGATTTGGA 
4411 AATGCATATC TGTATTTGAG TCGGTTTTTA AGTTCGTTGC TTTTGTAAAT ACAGAGGGAT TTGTATAAGA 

EcoRI 

44 81 AATATCTTTA AAAAACCCAT ATGCTAATTT GACATAATTT TTGAGAAAAA TATATATTCA GGCGAATTCC 
4551 ACAATGAACA ATAATAAGAT TAAAATAGCT TGCCCCCGTT GCAGCGATGG GTATTTTTTC TAGTAAAATA 
4621 AAAGAT AAA C TTAGACTCAA AACATTTACA AAAACAACCC CTAAAGTCCT AAAGCCCAAA GTGCTATGCA 
4691 CGATCCATAG CAAGCCCAGC CCAACCCAAC CCAACCCAAC CCACCCCAGT GCAGCCAACT GGCAAATAGT 
4 761 CTCCACCCCC GG CAC TAT C A CCGTGAGTTG TCCGCACCAC CGCACGTCTC GCAGCCAAAA AAAAAAAAAG 
4 831 AAAGAAAAAA AAGAAAAAGA AAAACAGCAG GTGGGTCCGG GTCGTGGGGG CCGGAAAAGC GAGGAGGATC 
4 901 GCGAGCAGCG ACGAGGCCCG GCCCTCCCTC CGCTTCCAAA GAAACGCCCC CCATCGCCAC TATATACATA 
4 971 CCCCCCCCTC TCCTCCCATC CCCCCAACCC TACCACCACC ACCACCACCA CCTCCTCCCC CCTCGCTGCC 
5041 GGACGACGAG CTCCTCCCCC CTCCCCCTCC GCCGCCGCCG GTAACCACCC CGCCCCTCTC CTCTTTCTTT 
5111 CTCCGTTTTT TTTTTCGTCT CGGTCTCGAT CTTTGGCCTT GGTAGTTTGG GTGGGCGAGA GCGGCTTCGT 

BamHI 

5181 CGCCCAGATC GGTGCGCGGG AGGGGCGGGA TCTCGCGGCT GGCGTCTCCG GGCGTGAGTC GGCCCGGATC 
BamHI Bglll 



5251 CTCGCGGGGA ATGGGGCTCT CGGATGTAGA TCTTCTTTCT TTCTTCTTTT TGTGGTAGAA TTTGAATCCC 
53 21 TCAGCATTGT TCATCGGTAG TTTTTCTTTT CATGATTTGT GACAAATGCA GCCTCGTGCG GAGCTTTTTT 
53 91 GTAGC 




Figure 3£ A 



Figure 



Ncol 

Hindi I I 



M AHA RVL LLAL A V L ATA AVAV 
AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTCGCCG 

Not I 



. ASS SSF ADSN PIR PVT DRAA AST 
71 TCGCCTCCTC CTCCTCCTTC GCCGACTCCA ACCCGATCCG GCCCGTCACC GACCGCGCGG CCGCCTCCAC 
. Q G I SEDL YSR LVE MATI SQA AYA 
141 GCAGGGCATC TCCGAAGACC TCTACAGCCG TTTAGTCGAA ATGGCCACTA TCTCCCAAGC TGCCTACGCC 

Sail 



DLCN IPS Til KGEK IYN SQT DING 
211 GACCTGTGCA ACATTCCGTC GACTATTATC AAGGGAGAGA AAATTTACAA TTCTCAAACT GACATTAACG 
BamHI 

. WIL RDD SSKE I I T VFR GTGS D T N • 

2 81 GATGGATCCT CCGCGACGAC AGC AG CAAAG AAATAATCAC CGTCTTCCGT GGCACTGGTA GTGATAG GAA 

* L Q L DTNY TLT PFD TLPQ CNG CEV 

3 51 TCTACAACTC GATACTAACT ACACCCTCAC GCCTTTCGAC ACCCTACCAC AATGCAACGG TTGTGAAGTA 

HGGY YIG WVS VQDQ VES LVK QQVS 
421 CACGGTGGAT ATTATATTGG ATGGGTCTCC GTCCAGGACC AAGTCGAGTC GCTTGTCAAA CAGCAGGTTA 

- QYP DYA LTVT GHX LGA SLAA L T A * 

4 91 GCCAGTATCC GGACTACGCG CTGACCGTGA CCGGCCACKC CCTCGGCGCC TCCCTGGCGG CACTCACTGC 

• A Q L SATY DNI RLY TFGE PRS GNQ 

5 61 CGCCCAGCTG TCTGCGACAT ACGACAACAT CCGCCTGTAC ACCTTCGGCG AACCGCGCAG CGGCAATCAG 

Xhol 



AFAS YMN DAF QASS PDT TQY FRVT 
631 GCCTTCGCGT CGTACATGAA CGATGCCTTC CAAGCCTCGA GCCCAGATAC GACGCAGTAT TTCCGGGTCA 

Ncol 



• HAN DGI PNLP PVE QGY AHGG V E Y • 
701 CTCATGCCAA CGACGGCATC CCAAACCTGC CCCCGGTGGA GCAGGGGTAC GCCCATGGCG GTGTAGAGTA 

-WSV DPYS AQN TFV CTGD EVQ CCE 
771 CTGGAGCGTT GATCCTTACA GCGCCCAGAA CACATTTGTC TGCACTGGGG ATGAAGTGCA GTGCTGTGAG 

SphI 

AQGG QGV NNA HTTY FGM TSG ACTW 
841 GCCCAGGGCG GACAGGGTGT GAATAATGCG CACACGACTT ATTTTGGGAT GACGAGCGGC GCATGCACCT 
NotI 



• P V A A A * 

911 GGCCGGTCGC GGCCGCGTAA CCACTGAAGG ATGAGCTGTA AAGAAGCAGA TCGTTCAAAC ATTTGGCAAT 
981 AAAGTTTCTT AAGATTGAAT CCTGTTGCCG GTCTTGCGAT GATTATCATA TAATTTCTGT TGAATTACGT 
1051 TAAGCATGTA ATAATTAACA TGTAATGCAT GACGTTATTT ATGAGATGGG TTTTTATGAT TAGAGTCCCG 
1121 CAATTATACA TTTAATACGC GATAGAAAAC AAAAT AT AG C GCGCAAACTA GGATAAATTA TCGCGCGCGG 

Hindi I I 



Clal Xbal 



1191 TGTCATCTAT GTTACTAGAT CGATAAGCTT CTAGAGCGGG CGGTGGAGCT CCAATTCGCC CTATAGTGAG 

1261 TCGTATTACG CGCGCTCACT GGCCGTCGTT TTACAACGTC GTGACTGGGA AAACCCTGGC GTTACCCAAC 

1331 TTAATCGCCT TGCAGCACAT CCCCCTTTCG CCAGCTGGCG TAATAGCGAA GAGGCCCGCA CCGATCGCCC 

14 01 TTCCCAACAG TTGCGCAGCC TGAATGGCGA ATGGGACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT 

1471 GTGGTGGTTA CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC 

1541 CTTCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG GGGCTCCCTT TAGGGTTCCG 

1611 ATTTAGTGCT TTACGGCACC TCGACCCCAA AAAACTTGAT TAGGGTGATG GTTCACGTAG TGGGCCATCG 



Figure 3C C 

1681 CCCTGATAGA CGGTTTTTCG CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA 

1751 CTGGAACAAC ACTCAACCCT ATGTCGGTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA TTTCGGCCTA 

1821 TTGGTTAAAA AATGAGCTGA TTTAACAAAA ATTTAACGCG AATTTTAACA AAATATTAAC GCTTACAATT 

1891 TAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC ATTCAAATAT 

1961 GTATCCGCTC ATGAGA CAAT AACCCTGATA AATGCTTCAA TAAT AT TGAA AAAGGAAGAG TATGAGTATT 

2 031 CAACATTTCC GTGTCGCCCT TATTCCCTTT TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT CACCCAGAAA 

2101 CGCTGGTGAA AGTAAAAGAT GCTGAAGATC AGTTGGGTGC ACGAGTGGGT TACATCGAAC TGGATCTCAA 

2171 CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC CGAAGAACGT TTTCCAATGA TGAGCACTTT TAAAGTTCTG 

2241 CTATGTGGCG CGGTATTATC CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA CACTATTCTC 

2311 AGAATGACTT GGTTGAGTAC TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAG TAAGAGAATT 

23 81 ATGCAGTGCT GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG 

24 51 AAGGAGCTAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT TGATCGTTGG GAACCGGAGC 
2 521 TGAATGAAGC CATACCAAAC GACGAGCGTG ACACCACGAT GCCTGTAGCA ATGGCAACAA CGTTGCGCAA 
2 5 91 ACTATTAACT GGCGAACTAC TTACTCTAGC TTCCCGGCAA CAATTAATAG ACTGGATGGA GGCGGATAAA 
2 661 GTTGCAGGAC CACTTCTGCG CTCGGCCCTT CCGGCTGGCT GGTTTATTGC TGATAAATCT GGAGCCGGTG 
2731 AGCGTGGGTC TCGCGGTATC ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA 
28 01 CACGACGGGG AGTCAGGCAA CTATGGATGA ACGAAATAGA CAGATCGCTG AGATAGGTGC CTCACTGATT 
2 8 71 AAGCATTGGT AACTGTCAGA CCAAGTTTAC TCATATATAC TTTAGATTGA TTTAAAACTT CAT TTT TAAT 

2 941 TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT GACCAAAATC CCTTAACGTG AGTTTTCGTT 

3 011 CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT TCTTGAGATC CTTTTTTTCT GCGCGTAATC 
3081 TGCTGCTTGC AAACAAAAAA ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC 
3151 TTTTTCCGAA GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGT C CTTCTAGTGT AGC CGTAGTT 
3221 AGGCCACCAC TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT ACCAGTGGCT 
32 91 GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT AAGGCGCAGC 
3361 GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT GGAGCGAACG ACCTACACCG AACTGAGATA 
3 431 CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA GGGAGAAAGG CGGACAGGTA TCCGGTAAGC 
3 5 01 GGCAGGGTCG GAACAGGAGA GCGCACGAGG GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTCCTG 
3 5 71 TCGGGTTTCG CCACCTCTGA CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA 
3 641 AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGGCTT TTGCTCACAT GTTCTTTCCT 
3 711 GCGTTAT'CCC CTGATTCTGT GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC 
3 7 81 GAACGACCGA GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCCA ATACGCAAAC CGCCTCTCCC 
3 8 51 CGCGCGTTGG CCGATTCATT AATGCAGCTG GCACGACAGG TTTCCCGACT GGAAAGCGGG CAGTGAGCGC 
3 921 AACGCAATTA ATGTGAGTTA GCTCACTCAT TAGGCACCCC AGGCTTTACA CTTTATGCTT CCGGCTCGTA 
'3991 TGTTGTGTGG AATTGTGAGC GGATAACAAT TTCACACAGG AAACAG C TAT GAC CATGATT ACGCCAAGCG 

Kpnl Xhol 



4 061 CGCAATTAAC CCTCACTAAA GGGAACAAAA GCTGGGTACC GGGCCCCCCC TCGAGGTCAT TCATATGCTT 

4131 GAGAAGAGAG TCGGGATAGT CCAAAATAAA ACAAAGGTAA GATTACCTGG TCAAAAGTGA AAACATCAGT 

42 01 TAAAAGGTGG TATAAGTAAA ATATCGGTAA TAAAAGGTGG CCCAAAGTGA AATTTACTCT TTTCTACTAT 

42 71 TATAAAAATT GAGGATGTTT TGTCGGTACT TTGATACGTC ATTTTTGTAT GAATTGGTTT TTAAGTTTAT 

4341 TCGCGATTTG GAAATGCATA TCTGTATTTG AGTCGGTTTT TAAGTTCGTT GCTTTTGTAA ATACAGAGGG 

4411 ATTTGTATAA GAAATATCTT TAAAAAACCC ATATGCTAAT TTGACATAAT TTTTGAGAAA AATATATATT 
EcoRI 



44 81 CAGGCGAATT CCACAATGAA CAATAATAAG ATTAAAATAG CTTGCCCCCG TTGCAGCGAT GGGTATTTTT 

4551 TCTAGTAAAA TAAAAGATAA ACTTAGACTC AAAACATTTA CAAAAACAAC CCCTAAAGTC CTAAAGCCCA 

4 621 AAGTGCTATG CACGATCCAT AGCAAGCCCA GCCCAACCCA ACCCAACCCA ACCCACCCCA GTGCAGCCAA 

46 91 CTGGCAAATA GTCTCCACCC CCGGCACTAT CACCGTGAGT TGTCCGCACC ACCGCACGTC TCGCAGCCAA 

4761 AAAAAAAAAA AGAAAGAAAA AAAAGAAAAA GAAAAACAGC AGGTGGGTCC GGGTCGTGGG GGCCGGAAAA 

4 831 GCGAGGAGGA TCGCGAGCAG CGACGAGGCC CGGCCCTCCC TCCGCTTCCA AAGAAACGCC CCCCATCGCC 

4 901 AC TATATACA TACCCCCCCC TCTCCTCCCA TCCCCCCAAC CCTACCACCA CCACCACCAC CACCTCCTCC 

4 971 CCCCTCGCTG CCGGACGACG AGCTCCTCCC CCCTCCCCCT CCGCCGCCGC CGGTAACCAC CCCGCCCCTC 

5 041 TCCTCTTTCT TTCTCCGTTT TTTTTTTCGT CTCGGTCTCG ATCTTTGGCC TTGGTAGTTT GGGTGGGCGA 
5111 GAGCGGCTTC GTCGCCCAGA TCGGTGCGCG GGAGGGGCGG GATCTCGCGG CTGGCGTCTC CGGGCGTGAG 

BamHI Bglll 



5181 TCGGCCCGGA TCCTCGCGGG GAATGGGGCT CTCGGATGTA GATCTTCTTT CTTTCTTCTT TTTGTGGTAG 
5251 AATTTGAATC CCTCAGCATT GTTCATCGGT AGTTTTTCTT TTCATGATTT GTGACAAATG CAGCCTCGTG 
53 21 CGGAGCTTTT TTGTAGC 
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Hindu I PstI 

M KQF S A K HVLA VVV TAG HAL 

AAGCTTAACA TGAAGCAGTT CTCCGCCAAA CACGTCCTCG CAGTTGTGGT GACTGCAGGG CACGCCTTAG 
• AST QGI SEDL YSR LVE MATI SQA 

CAGCCTCTAC GCAAGGCATC TCCGAAGACC TCTACAGCCG TTTAGTCGAA ATGGCCACTA TCTCCCAAGC 

Sail 



. A Y A DLCN IPS T I I KGEK IYN SQT 
TGCCTACGCC GACCTGTGCA ACATTCCGTC GACTATTATC AAGGGAGAGA AAATTTACAA TTCTCAAACT 
BamHI 



DING W I L RDD SSKE I I T VFR GTG 

S 

GACATTAACG GATGGATCCT CCGCGACGAC AGCAGCAAAG AAATAATCAC CGTCTTCCGT GGCACTGGTA 
•DTN LQL DTNY TLT PFD TLPQ CNG 



GTGATACGAA TCTACAACTC GATACTAACT ACACCCTCAC GCCTTTCGAC ACCCTACCAC AATGCAACGG 
-CEV HGGY YIG WVS VQDQ VES LVK 
3 51 TTGTGAAGTA CACGGTGGAT ATTATATTGG ATGGGTCTCC GTCCAGGACC AAGTCGAGTC GCTTGTCAAA 

qqVS QYP DYA LTVT GHX LGA SLA 

421 CAGCAGGTTA GCCAGTATCC GGACTACGCG CTGACCGTGA CCGGCCACKC CCTCGGCGCC TCCCTGGCGG 

-LTA AQL SATY DNI RLY TFGE PRS 

491 CACTCACTGC CGCCCAGCTG TCTGCGACAT ACGACAACAT CCGCCTGTAC ACCTTCGGCG AACCGCGCAG 

Xhol 

. G N Q AFAS YMN DAF QASS PDT TQY 
561 CGGCAATCAG GCCTTCGCGT CGTACATGAA CGATGCCTTC CAAGCCTCGA GCCCAGATAC GACGCAGTAT 

Ncol 

FRVT HAN DGI PNLP PVE QGY AHG 

G 

631 TTCCGGGTCA CTCATGCCAA CGACGGCATC CCAAACCTGC CCCCGGTGGA GCAGGGGTAC GCCCATGGCG 

V E Y WSV DPYS AQN TFV CTGD EVQ 

701 GTGTAGAGTA CTGGAGCGTT GATCCTTACA GCGCCCAGAA CACATTTGTC TGCACTGGGG ATGAAGTGCA 

•CCE AQGG QGV NNA HTTY FGM TSG 

GTGCTGTGAG GCCCAGGGCG GACAGGGTGT GAATAATGCG CACACGACTT ATTTTGGGAT GACGAGCGGC 
SphI NotI 



ACTW PVA A A E PLKD EL* 
841 GCATGCACCT GGCCGGTCGC GGCCGCGGAA CCACTGAAGG ATGAGCTGTA AAGAAGCAGA TCGTTCAAAC 

911 ATTTGGCAAT AAAGTTTCTT AAGATTGAAT CCTGTTGCCG GTCTTGCGAT GATTATCATA TAATTTCTGT 

9 81 TGAATTACGT TAAGCATGTA ATAATTAACA TGTAATGCAT GACGTTATTT AT GAGATGGG TTTTTATGAT 

10 51 TAGAGTCCCG CAATTATACA TTTAATACGC GATAGAAAAC AAAATATAGC GCGCAAACTA GGATAAATTA 

Hindi II 



Clal Xbal 



1121 TCGCGCGCGG TGTCATCTAT GTTACTAGAT CGATAAGCTT CTAGAGCGGC CGGTGGAGCT CCAATTCGCC 

1191 CTATAGTGAG TCGTATTACG CGCGCTCACT GGCCGTCGTT TTACAACGTC GTGACTGGGA AAACCCTGGC 

12 61 GTTACCCAAC TTAATCGCCT TGCAGCACAT CCCCCTTTCG CCAGCTGGCG TAATAGCGAA GAGGCCCGCA 

13 31 CCGATCGCCC TTCCCAACAG TTGCGCAGCC TGAATGGCGA ATGGGACGCG CCCTGTAGCG GCGCATTAAG 
1401 CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC 
1471 GCTTTCTTCC CTTCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG GGGCTCCCTT 



Figure ?? C 



1541 TAGGGTTCCG ATTTAGTGCT TTACGGCACC TCGACCCCAA AAAACTTGAT TAGGGTGATG GTTCACGTAG 

1611 TGGGCCATCG CCCTGATAGA CGGTTTTTCG CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC 

1681 TTGTTCCAAA CTGGAACAAC ACTCAACCCT ATCTCGGTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA 

175 1 TTTCGGCCTA TTGGTTAAAA AATGAGCTGA TTTAACAAAA ATTTAACGCG AATTTTAACA AAATATTAAC 

1821 GCTTACAATT TAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC 

1891 ATTCAAATAT GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA AAAGGAAGAG 

1961 TATGAGTATT CAACATTTCC GTGTCGCCCT TATTCCCTTT TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT 

2 031 CACCCAGAAA CGCTGGTGAA AGTAAAAGAT GCTGAAGATC AGTTGGGTGC ACGAGTGGGT TACATCGAAC 

2101 TGGATCTCAA CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC CGAAGAACGT TTTCCAATGA TGAGCACTTT 

2171 TAAAGTTCTG CTATGTGGCG CGGTATTATC CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA 

2241 CACTATTCTC AGAATGACTT GGTTGAGTAC TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAG 

2311 TAAGAGAATT ATGCAGTGCT GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT 

23 81 CGGAGGACCG AAGGAGC TAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT TGATCGTTGG 

2451 GAACCGGAGC TGAATGAAGC CATACCAAAC GACGAGCGTG ACACCACGAT GCCTGTAGCA ATGGCAACAA 

2521 CGTTGCGCAA ACTATTAACT GGCGAACTAC TTACTCTAGC TTCCCGGCAA CAATTAATAG ACTGGATGGA 

25 91 GGCGGATAAA GTTGCAGGAC CACTTCTGCG CTCGGCCCTT CCGGCTGGCT GGTTTATTGC TGATAAAT C T 

2661 GGAGCCGGTG AGCGTGGGTC TCGCGGTATC ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG 

2 731 TAGTTATCTA CACGACGGGG AGTCAGGCAA CTATGGATGA ACGAAATAGA CAGATCGCTG AGATAGGTGC 

28 01 CTCACTGATT AAGCATTGGT AACTGTCAGA CCAAGTTTAC TCATATATAC TTTAGATTGA TTTAAAACTT 

28 71 CATTTTTAAT TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT GACCAAAATC CCTTAACGTG 

2 941 AGTTTTCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT TCTTGAGATC CTTTTTTTCT 
30H GCGCGTAATC TGCTGCTTGC AAACAAAAAA ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG 
30 81 CTACCAACTC TTTTTCCGAA GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGTC CTTCTAGTGT 
3151 AGCCGTAGTT AGGCCACCAC TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT 
3221 ACCAGTGGCT GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT 

32 91 AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT GGAGCGAACG ACCTACACCG 

33 61 AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA GGGAGAAAGG CGGACAGGTA 

34 31 TCCGGTAAGC GGCAGGGTCG GAACAGGAGA GCGCACGAGG GAGCTTCCAG GGGGAAACGC CTGGTATCTT 

35 01 TATAGTCCTG TCGGGTTTCG CCACCTCTGA CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA 

3 571 GCCTATGGAA AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT 
3641 GTTCTTTCCT GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT 
3 711 CGCCGCAGCC GAACGACCGA GCGCAGCGAG TGAGTGAGCG AGGAAGCGGA AGAGCGCCCA AT AC GC AAAC 
3 781 CGCCTCTCCC CGCGCGTTGG CCGATTCATT AATGCAGCTG GCACGACAGG TTTCCCGACT GGAAAGCGGG 
3 851 CAGTGAGCGC AACGCAATTA ATGTGAGTTA GCTCACTCAT TAGGCACCCC AGGCTTTACA CTTTATGCTT 
3 921 CCGGCTCGTA TGTTGTGTGG AATTGTGAGC GGATAACAAT TTCACACAGG AAACAGCTAT GACCATGATT 

Kpnl Xhol 



3 991 ACGCCAAGCG CGCAATTAAC CCTCACTAAA GGGAACAAAA GCTGGGTACC GGGCCCCCCC TCGAGGTCAT 

4 0 61 TCATATGCTT GAGAAGAGAG TCGGGATAGT CCAAAATAAA AC AAAGG TAA GATTACCTGG TCAAAAGTGA 
4131 AAACATCAGT TAAAAGGTGG TATAAGTAAA ATATCGGTAA TAAAAGGTGG CCCAAAGTGA AATTTACTCT 
42 01 TTTCTACTAT TATAAAAATT GAGGATGTTT TGTCGGTACT TTGATACGTC ATTTTTGTAT GAATTGGTTT 
4271 TTAAGTTTAT TCGCGATTTG GAAATGCATA TCTGTATTTG AGTCGGTTTT TAAGTTCGTT GCTTTTGTAA 
4 341 ATACAGAGGG ATTTGTATAA GAAATATCTT TAAAAAACCC ATATGCTAAT TTGACATAAT TTTTGAGAAA 

EcoRI 



4411 AAT AT AT AT T CAGGCGAATT CCACAATGAA CAATAATAAG ATTAAAATAG CTTGCCCCCG TTGCAGCGAT 

44 81 GGGTATTTTT TCTAGTAAAA TAAAAGATAA ACTTAGACTC AAAACATTTA CAAAAACAAC CCCTAAAGTC 

4551 CTAAAGCCCA AAGTGCTATG CACGATCCAT AGCAAGCCCA GCCCAACCCA ACCCAACCCA ACCCACCCCA 

4621 GTGCAGCCAA CTGGCAAATA GTCTCCACCC CCGGCACTAT CACCGTGAGT TGTCCGCACC ACCGCACGTC 

4 6 91 TCGCAGCCAA AAAAAAAAAA AGAAAGAAAA AAAAGAAAAA GAAAAACAGC AGGTGGGTCC GGGTCGTGGG 

4 761 GGCCGGAAAA GCGAGGAGGA TCGCGAGCAG CGACGAGGCC CGGCCCTCCC TCCGCTTCCA AAGAAACGCC 

4 831 CCCCATCGCC ACTATATACA TACCCCCCCC TCTCCTCCCA TCCCCCCAAC CCTACCACCA CCACCACCAC 

4 901 CACCTCCTCC CCCCTCGCTG CCGGACGACG AGCTCCTCCC CCCTCCCCCT CCGCCGCCGC CGGTAACCAC 

4 971 CCCGCCCCTC TCCTCTTTCT TTCTCCGTTT TTTTTTTCGT CTCGGTCTCG ATCTTTGGCC TTGGTAGTTT 

5041 GGGTGGGCGA GAGCGGCTTC GTCGCCCAGA TCGGTGCGCG GGAGGGGCGG GATCTCGCGG CTGGCGTCTC 

BamHI Bglll 



5111 CGGGCGTGAG TCGGCCCGGA TCCTCGCGGG GAATGGGGCT CTCGGATGTA GATCTTCTTT CTTTCTTCTT 

5181 TTTGTGGTAG AATTTGAATC CCTCAGCATT GTTCATCGGT AGTTTTTCTT TTCATGATTT GTGACAAATG 

5251 CAGCCTCGTG CGGAGCTTTT TTGTAGC 



Figure 3£ A 




Figure 3g_B 



BamHI 



-ILR DDSS K E I I T V FRGT GSD TNL 
1 GATCCTCCGC GA C G AC AG C A GCAAAGAAAT AATCACCGTC TTCCGTGGCA CTGGTAGTGA TACGAATCTA 

Q L D T NYT LTP FDTL PQC NGC EVHG 
71 CAACTCGATA CTAACTACAC CCTCACGCCT TTCGACACCC TACCACAATG CAACGGTTGT GAAGTACACG 

• GYY I G W VSVQ DQV ESL VKQQ V S Q • 
141 GTGGATAT T A TATTGGATGG GTCTCCGTCC AGGACCAAGT CGAGTCGCTT GTCAAACAGC AGGTTAGCCA 

. Y P D YALT VTG HXL GASL AAL TAA 
211 GTATCCGGAC TACGCGCTGA CCGTGACCGG CCACKCCCTC GGGGCCTCCC TGGCGGCACT CACTGCCGCC 

QLSA TYD NIR LYTF GEP RSG NQAF 
2 81 CAGCTGTCTG CGACATACGA CAACATCCGC CTGTACACCT TCGGCGAACC GCGCAGCGGC AATCAGGCCT 

Xhol 



> A S Y MND AFQA SSP DTT QYFR VTH* 
3 51 TCGCGTCGTA CATGAACGAT GCCTTCCAAG CCTCGAGCCC AGATACGACG CAGTATTTCC GGGTCACTCA 

Ncol 



• AND GIPN LPP VEQ GYAH GGV EYW 
421 TGCCAACGAC GGCATCCCAA ACCTGCCCCC GGTGGAGCAG GGGTACGCCC ATGGCGGTGT AGAGTACTGG 

SVDP YSA QNT FVCT GDE VQC CEAQ 
4 91 AGCGTTGATC CTTACAGCGC CCAGAACACA TTTGTCTGCA CTGGGGATGA AGTGCAGTGC TGTGAGGCCC 

G G Q GVN NAHT TYF GMT SGAC T W * - 
561 AGGGCGGACA GGGTGTGAAT AATGCGCACA CGACTTATTT TGGGATGACG AGCGGAGCCT GTACATGGTG 

. * 

631 ATCAGTCATT TCAGCCTCCC CGAGTGTACC AGGAAAGATG GATGTCCTGG AGAGGGGGCC GCGTAACCAC 
701 TGAAGGATGA GCTGTAAAGA AGCAGAT CGT TCAAACATTT GGCAATAAAG TTTCTTAAGA TTGAATCCTG 

7 71 TTGCCGGTCT TGCGATGATT AT CATATAAT TTCTGTTGAA TTACGTTAAG CATGTAATAA TTAACATGTA 

841 ATGCATGACG TTATTTATGA GATGGGTTTT TATGATTAGA GTCCCGCAAT TATACATTTA ATACGCGATA 

Clal 

911 GAAAACAAAA TATAGCGCGC AAACTAGGAT AAATTATCGC GCGCGGTGTC ATCTATGTTA CTAGATCGAT 

Xbal 



Hindi I I 



981 AAGCTTCTAG AGCGGCCGGT GGAGCTCCAA TTCGCCCTAT AGTGAGTCGT ATTACGCGCG CTCACTGGCC 

1051 GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC 

1121 CTTTCGCCAG CTGGCGTAAT AGCGAAGAGG CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA 

1191 TGGCGAATGG GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC 

12 61 GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG 

1331 GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT AGTGCTTTAC GGCACCTCGA 

14 01 CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG CCATCGCCCT GATAGACGGT TTTTCGCCCT 

14 71 TTGACGTTGG AGTCCACGTT CTTTAATAGT GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT 

1541 CGGTCTATTC TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA 

1611 ACAAAAATTT AACGCGAATT TTAACAAAAT ATTAACGCTT ACAATTTAGG TGGCACTTTT CGGGGAAATG 

16 81 TGGGCGGAAC CCCTATTTGT TTATTTTTCT AAATACATTC AAATATGTAT CCGCTCATGA GACAATAACC 

17 51 CTGATAAATG CTTCAATAAT AT T GAAAAAG GAAGAGTATG AGTATTCAAC ATTTCCGTGT CGCCCTTATT 
1821 CCCTTTTTTG CGGCATTTTG CCTTCCTGTT TTTGCTCACC CAGAAACGCT GGTGAAAGTA AAAGATGCTG 

18 91 AAGATCAGTT GGGTGCACGA GTGGGTTACA TCGAACTGGA TCTCAACAGC GGTAAGATCC TTGAGAGTTT 
1961 TCGCCCCGAA GAACGTTTTC CAATGATGAG CACTTTTAAA GTTCTGCTAT GTGGCGCGGT AT TAT CC CGT 
2 031 ATTGACGCCG GGCAAGAGCA ACTCGGTCGC CGCATACACT ATTCTCAGAA TGACTTGGTT GAGTACTCAC 
2101 CAGTCACAGA AAAGCATCTT ACGGATGGCA TGACAGTAAG AGAATTATGC AGTGCTGCCA TAACCATGAG 
2171 T GAT AAC AC T GCGGCCAACT TACTTCTGAC AACGATCGGA GGACCGAAGG AGCTAACCGC TTTTTTGCAC 
2241 AACATGGGGG ATCATGTAAC TCGCCTTGAT CGTTGGGAAC CGGAGCTGAA TGAAGCCATA CCAAACGACG 
2 311 AGCGTGACAC CACGATGCCT GTAGCAATGG CAACAACGTT GCGCAAACTA TTAACTGGCG AACTACTTAC 
23 81 TCTAGCTTCC CGGCAACAAT TAATAGACTG GATGGAGGCG GATAAAGTTG CAGGACCACT TCTGCGCTCG 
2451 GCCCTTCCGG CTGGCTGGTT TATTGCTGAT AAATCTGGAG CCGGTGAGCG TGGGTCTCGC GGTATCATTG 
2521 CAGCACTGGG GCCAGATGGT AAGCCCTCCC GTATCGTAGT TATCTACACG ACGGGGAGTC AGGCAACTAT 
25 91 GGATGAACGA AATAGACAGA TCGCTGAGAT AGGTGCCTCA CTGATTAAGC ATTGGTAACT GTCAGACCAA 
2661 GTTTACTCAT ATATACTTTA GATTGATTTA AAACTTCATT TTTAATTTAA AAGGAT C TAG GTGAAGATCC 



Figure 3$ C 



2 731 TTTTTGATAA TCTCATGACC AAAATCCCTT AACGTGAGTT TTCGTTCCAC TGAGCGTCAG ACCCCGTAGA 

2 8 01 AAAGAT C AAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC GTAATCTGCT GCTTGCAAAC AAAAAAACCA 

2 871 CCGCTACCAG CGGTGGTTTG TTTGCCGGAT CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA 

2 941 GCAGAGCGCA GATACCAAAT ACTGTCCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT 

3 011 AGCACCGCCT ACATACCTCG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT 
3 0 81 CTTACCGGGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG GGGGGTTCGT 
3151 GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT GAG AT AC C T A CAGCGTGAGC TATGAGAAAG 
3221 CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC 
32 91 ACGAGGGAGC TTCCAGGGGG AAACGCCTGG TATCTTTATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG 
3 3 61 AGCGTCGATT TTTGTGATGC TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT 
3431 ACGGTTCCTG GCCTTTTGCT GGCCTTTTGC TCACATGTTC TTTCCTGCGT TATCCCCTGA TTCTGTGGAT 
35 01 AACCGTATTA CCGCCTTTGA GTGAGCTGAT ACCGCTCGCC GCAGCCGAAC GACCGAGCGC AGCGAGTCAG 
3 5 71 TGAGCGAGGA AG C GGAAG AG CGCCCAATAC GCAAACCGCC TCTCCCCGCG CGTTGGCCGA TTCATTAATG 
3 641 CAGCTGGCAC GACAGGTTTC CCGACTGGAA AGCGGGCAGT GAGCGCAACG CAATTAATGT GAGTTAGCTC 
3 711 ACTCATTAGG CACCCCAGGC TTTACACTTT ATGCTTCCGG CTCGTATGTT GTGTGGAATT GTGAGCGGAT 
3 7 81 AACAATTTCA CACAGGAAAC AGCTATGACC ATGATTACGC CAAGCGCGCA ATTAACCCTC ACTAAAGGGA 

Kpnl Xhol 



3 851 ACAAAAGCTG GGTACCGGGC CCCCCCTCGA GGTCATTCAT ATGCTTGAGA AGAGAGTCGG GATAGTCCAA 
3 921 AATAAAACAA AGGTAAGATT ACCTGGTCAA AAGTGAAAAC ATCAGTTAAA AGGTGGTATA AGTAAAATAT 
3 9 91 CGGTAATAAA AGGTGGCCCA AAGTGAAATT TACTCTTTTC TACTATTATA AAAATTGAGG ATGTTTTGTC 
4061 GGTACTTTGA TACGTCATTT TTGTATGAAT TGGTTTTTAA GTTTATTCGC GATTTGGAAA TGCATATCTG 
4131 TATTTGAGTC GGTTTTTAAG TTCGTTGCTT TTGTAAATAC AGAGGGATTT GTATAAGAAA TATCTTTAAA 

EcoRI 



4201 AAACCCATAT GCTAATTTGA CATAATTTTT GAGAAAAATA TATATTCAGG CGAATTCCAC AATGAACAAT 

42 71 AATAAGATTA AAATAGCTTG CCCCCGTTGC AGCGATGGGT ATTTTTTCTA GTAAAATAAA AGATAAACTT 

4 341 AGACTCAAAA CATTTACAAA AACAACCCCT AAAGTCCTAA AGCCCAAAGT GCTATGCACG ATCCATAGCA 

4411 AGCCCAGCCC AACCCAACCC AACCCAACCC ACCCCAGTGC AGCCAACTGG CAAATAGTCT CCACCCCCGG 

44 81 CACTATCACC GTGAGTTGTG CGCACCACCG CACGTCTCGC AGCCAAAAAA AAAAAAAGAA AGAAAAAAAA 

4551 GAAAAAGAAA AACAGCAGGT GGGTCCGGGT CGTGGGGGCC GGAAAAGCGA GGAGGATCGC GAGCAGCGAC 

4 621 GAGGCCCGGC CCTCCCTCCG CTTCCAAAGA AACGCCCCCC ATCGCCACTA T AT AC AT AC C CCCCCCTCTC 

4 6 91 CTCCCATCCC CCCAACCCTA CCACCACCAC CACCACCACC TCCTCCCCCC TCGCTGCCGG ACGACGAGCT 

4 761 CCTCCCCCCT CCCCCTCCGC CGCCGCCGGT AACCACCCCG CCCCTCTCCT CTTTCTTTCT CCGTTTTTTT 

4 831 TTTCGTCTCG GTCTCGATCT TTGGCCTTGG TAGTTTGGGT GGGCGAGAGC GGCTTCGTCG CCCAGATCGG 

BamHI 



4 901 TGCGCGGGAG GGGCGGGATC TCGCGGCTGG CGTCTCCGGG CGTGAGTCGG CCCGGATCCT CGCGGGGAAT 

Bglll 



4 9 71 GGGGCTCTCG GATGTAGATC TTCTTTCTTT CTTCTTTTTG TGGTAGAATT TGAATCCCTC AGCATTGTTC 

Hindi II 

5 041 ATCGGTAGTT TTTCTTTTCA TGATTTGTGA CAAATGCAGC CTCGTGCGGA GCTTTTTTGT AGCAAGCTTA 

PStI 



MKQ FSA KHVL AVV VTA GHAL A A S • 
5111 ACATGAAGCA GTTCTCCGCC AAACACGTCC TCGCAGTTGT GGTGACTGCA GGGCACGCCT TAGCAGCCTC 

. T Q G I 'S E D LYS RLV EMAT ISQ AAY 
5181 TACGCAAGGC ATCTCCGAAG ACCTCTACAG CCGTTTAGTC GAAATGGCCA CTATCTCCCA AGCTGCCTAC 

Sail 



ADLC NIP STI IKGE KIY NSQ TDIN 
5251 GCCGACCTGT GCAACATTCC GTCGACTATT ATCAAGGGAG AGAAAATTTA CAATTCTCAA ACTGACATTA 

B 



• G W 
53 21 ACGGATG 



Figure 3J A 




amp 



Figure 3^ B 



Ncol 



Hindi I I 



M AHA RVL LLAL A V L ATA AVAV 
1 AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTCGCCG 

NPIR NotI 



• ASS SSF ADSN PIR PVT DRAA AST 
71 TCGCCTCCTC CTCCTCCTTC GCCGACTCCA ACCCGATCCG GCCCGTCACC GACCGCGCGG CCGCCTCCAC 
•QGI SEDL YSR LVE MATI SQA AYA 
141 GCAGGGCATC TCCGAAGACC TCTACAGCCG TTTAGT CGAA ATGGCCACTA TCTCCCAAGC TGCCTACGCC 

Sail 



D L C N IPS Til KGEK IYN SQT DING 
211 GACCTGTGCA ACATTCCGTC GACTATTATC AAGGGAGAGA AAATTTAGAA TTCTCAAACT GACATTAACG 

BamHI 



• W I L RDD SSKE I I T VFR GTGS DTN< 
281 GATGGATCCT CCGCGACGAC AGCAGCAAAG AAATAAT C AC CGTCTTCCGT GGCACTGGTA GTGATACGAA 

Glycosylation 

•LQL DTNY TLT PFD TLPQ CNG CEV 
351 TCTACAACTC GATACTAACT ACACCCTCAC GCCTTTCGAC ACCCTACCAC AATGCAACGG TTGTGAAGTA 

HGGY YIG WVS VQDQ VES LVK QQVS 
421 CACGGTGGAT ATTATATTGG ATGGGTCTCC GTCCAGGACC AAGTCGAGTC GCTTGTCAAA CAGCAGGTTA 

• QYP DYA LTVT GHX LGA SLAA L T A * 
491 GCCAGTATCC GGACTACGCG CTGACCGTGA CCGGCCACKC CCTCGGCGCC TCCCTGGCGG CACTCACTGC 

• A Q L SATY DNI RLY TFGE PRS GNQ 
561 CGCCCAGCTG TCTGCGACAT AC GAC AAC AT CCGCCTGTAC ACCTTCGGCG AACCGCGCAG CGGCAATCAG 

Xhol 



AFAS YMN DAF QASS PDT TQY FRVT 
631 GCCTTCGCGT CGTACATGAA CGATGCCTTC CAAGCCTCGA GCCCAGATAC GACGCAGTAT TTCCGGGTCA 

Ncol 



• HAN DGI PNLP PVE QGY AHGG VEY 
701 CTCATGCCAA CGACGGCATC CCAAACCTGC CCCCGGTGGA GCAGGGGTAC GCCCATGGCG GTGTAGAGTA 

•WSV DPYS AQN TFV CTGD EVQ CCE 
771 CTGGAGCGTT GATCCTTACA GCGCCCAGAA CACATTTGTC TGCACTGGGG ATGAAGTGCA GTGCTGTGAG 

SphI 

AQGG QGV NNA HTTY FGM TSG ACTW 
841 GCCCAGGGCG GACAGGGTGT GAATAATGCG CACACGACTT ATTTTGGGAT GACGAGCGGC GCATGCACCT 



Figure ^ C 



Not I KDEL 



PVA A A E TTEG * 
911 GGCCGGTCGC GGCCGCGGAA ACCACTGAAG GATGAGCTGT AAAGAAGCAG ATCGTTCAAA CATTTGGCAA 
981 TAAAGTTTCT TAAGATTGAA TCCTGTTGCC GGTCTTGCGA TGATTATCAT ATAATTTCTG TTGAATTACG 
1051 TTAAGCATGT AATAATTAAC ATGTAATGCA TGACGTTATT TATGAGATGG GTTTTTATGA TTAGAGTCCC 
1121 GCAATTATAC ATTTAATACG CGATAGAAAA CAAAATATAG CGCGCAAACT AGGATAAATT ATCGCGCGCG 

Hindi I I 



Clal Xbal 



1191 GTGTCATCTA TGTTACTAGA TCGATAAGCT TCTAGAGCGG CCGGTGGAGC TCCAATTCGC CCTATAGTGA 

12 61 GTCGTATTAC GCGCGCTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG CGTTACCCAA 

13 31 CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC ACCGATCGCC 
1401 CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG 

14 71 TGTGGTGGTT ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT CGCTTTCTTC 
1541 CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG GGGGCTCCCT TTAGGGTTCC 
1611 GATTTAGTGC TTTACGGCAC CTCGACCCCA AAAAACTTGA TTAGGGTGAT GGTTCACGTA GTGGGCCATC 

16 81 GCCCTGATAG ACGGTTTTTC GCCCTTTGAC GTTGGAGTCC ACGTTCTTTA ATAGTGGACT CTTGTTCCAA 

17 51 ACTGGAACAA CACTCAACCC TATCTCGGTC TATTCTTTTG ATTTATAAGG GATTTTGCCG ATTTCGGCCT 
1821 ATTGGTTAAA AAATGAGCTG ATTTAACAAA AATTTAACGC GAATTTTAAC AAAATATTAA CGCTTACAAT 

18 91 TTAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA 
1961 TGTATCCGCT CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT 
2 031 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC TCACCCAGAA 
2101 ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG TTACATCGAA CTGGATCTCA 
2171 ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT 
2241 GCTATGTGGC GCGGTATTAT CCCGTATTGA GGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT 
2311 CAGAATGACT TGGTTGAGTA CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT 

23 81 TATGCAGTGC TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC 

24 51 GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT GTAACTCGCC TTGATCGTTG GGAACCGGAG 
2521 CTGAATGAAG C CAT AC C AAA CGACGAGCGT GACACCACGA TGCCTGTAGC AATGGCAACA ACGTTGCGCA 

25 91 AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA GACTGGATGG AGGCGGATAA 
2661 AGTTGCAGGA CCACTTCTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT 
2 731 GAGCGTGGGT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT 
2801 ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT 
2 8 71 TAAGCATTGG TAACTGTCAG ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA 

2 941 TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT CCCTTAACGT GAGTTTTCGT 

3 011 TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT 
3 081 CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT 
3151 CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT 
3221 TAGGCCACCA CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 

32 91 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG 

33 61 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT 
3431 ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT ATCCGGTAAG 
3 501 CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT 
3 5 71 GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA 
3 641 AAAACGCCAG CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 
3 711 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC 
3 781 CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC AATACGCAAA CCGCCTCTCC 
3851 CCGCGCGTTG GCCGATTCAT TAATGCAGCT GGCACGACAG GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG 
3 921 CAACGCAATT AATGTGAGTT AGCTCACTCA TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT 



Figure D 



3 991 ATGTTGTGTG GAATTGTGAG CGGATAACAA 



4 061 GCGCAATTAA CCCTCACTAA AGGGAACAAA 

4131 TGAGAAGAGA GTCGGGATAG TCCAAAATAA 

42 01 TTAAAAGGTG GTATAAGTAA AATATCGGTA 

4271 TTATAAAAAT TGAGGATGTT TTGTCGGTAC 

4341 TTCGCGATTT GGAAATGCAT ATCTGTATTT 

4411 GATTTGTATA AGAAATATCT TTAAAAAACC 
EcoRI 



44 81 TCAGGCGAAT TCCACAATGA ACAATAATAA 

4551 TTCTAGTAAA ATAAAAGATA AACTTAGACT 

4621 AAAGTGCTAT GCACGATCCA TAGCAAGCCC 

46 91 ACTGGCAAAT AGTCTCCACC CCCGGCACTA 

4761 AAAAAAAAAA AAGAAAGAAA AAAAAGAAAA 

4 831 AGCGAGGAGG ATCGCGAGCA GCGACGAGGC 

4 901 CACTATATAC ATACCCCCCC CTCTCCTCCC 

4 971 CCCCCTCGCT GCCGGACGAC GAGCTCCTCC 

5041 CTCCTCTTTC TTTCTCCGTT TTTTTTTTCG 

5111 AGAGCGGCTT CGTCGCCCAG ATCGGTGCGC 
BamHI 



5181 GTCGGCCCGG ATCCTCGCGG GGAATGGGGC 
5251 GAATTTGAAT CCCTCAGCAT TGTTCATCGG 
53 21 GCGGAGCTTT TTTGTAGC 



TTTCACACAG GAAACAGCTA TGACCATGAT TACGCCAAGC 
Kpnl Xhol 



AGCTGGGTAC CGGGCCCCCC CTCGAGGTCA TTCATATGCT 
AACAAAGGTA AGATTACCTG GTCAAAAGTG AAAACATCAG 
ATAAAAGGTG GCCCAAAGTG AAATTTACTC TTTTCTACTA 
TTTGATACGT CATTTTTGTA TGAATTGGTT TTTAAGTTTA 
GAGTCGGTTT TTAAGTTCGT TGCTTTTGTA AATACAGAGG 
CATATGCTAA TTTGACATAA TTTTTGAGAA AAATATATAT 



GATTAAAATA GCTTGCCCCC GTTGCAGCGA TGGGTATTTT 
CAAAACATTT ACAAAAACAA CCCCTAAAGT CCTAAAGCCC 
AGCCCAACCC AACCCAACCC AACCCACCCC AGTGCAGCCA 
TCACCGTGAG TTGTCCGCAC CACCGCACGT CTCGCAGCCA 
AGAAAAACAG CAGGTGGGTC CGGGTCGTGG GGGCCGGAAA 
CCGGCCCTCC CTCCGCTTCC AAAGAAACGC CCCCCATCGC 
ATCCCCCCAA CCCTAGCACC ACCACCACCA CCACCTCCTC 
CCCCTCCCCC TCCGCCGCCG CCGGTAACCA CCCCGCCCCT 
TCTCGGTCTC GATCTTTGGC CTTGGTAGTT TGGGTGGGCG 
GGGAGGGGCG GGATCTCGCG GCTGGCGTCT CCGGGCGTGA 
Bglll 



TCTCGGATGT AGATCTTCTT TCTTTCTTCT TTTTGTGGTA 
TAGTTTTTCT TTTCATGATT TGTGACAAAT GCAGCCTCGT 
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Hindlll (2) 
Ncol (9) 
ATG 

aleurain_sig nal/N PIR 
NPIR 



Rice_actin 
EcoRI (4487) 



Aval (4111) 



fvull (3878) 




4- 



pTU5 

5337 bp 




Aval (103) 

Smal(105) 
NotI (129) 
BamHI (285) 
\ Glycosylation 
lntron_1 

kJ FAE 

if 

PvuII (568) 
\ Aval (667) 
Ncol (754) 
NotI (921) 
Redundancy (stop) 
KDEL 
nos 

Clal (1211) 
Hindlll (1216) 
Xbal (1221) 
PvuII (1365) 
f1_ori 
ApaLI (2138) 



amp 



ence for pTU5 



Hindi I I Ncol 



1 AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCGGTGCT 
TTCGAATGGT ACCGGGTGCG GGCGCAGGAG GAGGACCGCG AGCGGCACGA 

51 GGCCACGGCC GCCGTCGCCG TCGCCTCCTC CTCCTCCTTC GCCGACTCCA 
CCGGTGCCGG CGGCAGCGGC AGCGGAGGAG GAGGAGGAAG CGGCTGAGGT 

Smal 



Aval Not I 



101 AGCCGGGCCG GCCCGTCACC GACCGCGCGG CCGCCTCCAC GCAGGGCATC 
TGGGCCCGGC CGGGCAGTGG CTGGCGCGCC GGCGGAGGTG CGTCCCGTAG 

151 TCCGAAGACC TCTACAGCCG TTTAGTCGAA ATGGCCACTA TCTCCCAAGC 
AGGCTTCTGG AGATGTCGGC AAATCAGCTT TACCGGTGAT AGAGGGTTCG 

201 TGCCTACGCC GACCTGTGCA ACATTCCGTC GAGTATTATC AAGGGAGAGA 
ACGGATGCGG CTGGACACGT TGTAAGGCAG CTGATAATAG TTCCCTCTCT 

BamHI 



251 AAATTTACAA TTCTCAAAGT GACATTAAGG GATGGATCCT CCGCGACGAC 
TTTAAATGTT AAGAGTTTGA CTGTAATTGC CTACCTAGGA GGCGCTGCTG 

301 AGCAGCAAAG AAATAAT C AC CGTCTTCCGT GGCACTGGTA GTGATACGAA 
TCGTCGTTTC TTTATTAGTG GCAGAAGGCA CCGTGACCAT CACTATGCTT 

3 51 TCTACAACTC GATACTAACT ACACCCTCAC GCCTTTCGAC ACCCTACCAC 
AGATGTTGAG CTATGATTGA TGTGGGAGTG CGGAAAGCTG TGGGATGGTG 

401 AATGCAACGG TTGTGAAGTA CACGGTGGAT ATTATATTGG ATGGGTCTCC 
TTACGTTGCC AACACTTCAT GTGCCACCTA TAATATAACC TACCCAGAGG 

451 GTCGAGGACC AAGTCGAGTC GCTTGTCAAA CAGCAGGTTA GCCAGTATCC 
CAGGTCCTGG TTCAGCTGAG CGAAGAGTTT GTCGTCCAAT CGGTCATAGG 

501 GGACTACGCG CTGACCGTGA CCGGCCACKC CCTCGGCGCC TCCCTGGCGG 
CCTGATGCGC GACTGGCACT GGCCGGTGMG GGAGCCGCGG AGGGACCGCC 

PvuII 



551 CACTCACTGC CGCCCAGCTG TCTGCGACAT ACGACAACAT CCGCCTGTAC 
GTGAGTGACG GCGGGTCGAC AGACGCTGTA TGCTGTTGTA GGCGGACATG 

601 ACCTTCGGCG AACCGCGCAG CGGCAATCAG GCCTTCGCGT CGTACATGAA 
TGGAAGCCGC TTGGCGCGTC GCCGTTAGTC CGGAAGCGCA GCATGTACTT 

Aval 



651 CGATGCCTTC CAAGCCTCGA GCCCAGATAC GACGCAGTAT TTCCGGGTCA 
GCTACGGAAG GTTCGGAGCT CGGGTCTATG CTGCGTCATA AAGGCCCAGT 



7 01 CTCATGCCAA CGACGGCATC CCAAACCTGC CCCCGGTGGA GCAGGGGTAC 
GAGTACGGTT GCTGCCGTAG GGTTTGGACG GGGGCCACCT CGTCCCCATG 



751 GCCCATGGCG GTGTAGAGTA CTGGAGCGTT GATCCTTACA GCGCCCAGAA 
CGGGTACCGC CACATCTCAT GACCTCGCAA CTAGGAATGT CGCGGGTCTT 

801 CACATTTGTC TGCACTGGGG ATGAAGTGCA GTGCTGTGAG GCCCAGGGCG 
GTGTAAACAG ACGTGACCCC TACTTCACGT CACGACACTC CGGGTCCCGC 

851 GACAGGGTGT GAATAATGCG CACACGACTT ATTTTGGGAT GACGAGCGGC 
CTGTCCCACA CTTATTACGC GTGTGCTGAA TAAAAC C CTA CTGCTCGCCG 



901 GCATGCACCT GGCCGGTCGC GGCCGCGGAA CCACTGAAGG ATGAGCTGTA 
CGTACGTGGA CCGGCCAGCG CCGGCGCCTT GGTGACTTGC TACTCGACAT 

951 AAGAAGCAGA TCGTTCAAAC ATTTGGCAAT AAAGTTTCTT AAGATTGAAT 
TTCTTCGTCT AGCAAGTTTG TAAACCGTTA TTT CAAAGAA TTCTAACTTA 

10 01 CCTGTTGCCG GTCTTGCGAT GATTATCATA TAATTTCTGT TGAATTACGT 
GGACAACGGC CAGAACGCTA CTAATAGTAT ATTAAAGACA ACTTAATGCA 

10 51 TAAGCATGTA ATAATTAACA TGTAATGCAT GACGTTATTT ATGAGATGGG 
ATTCGTACAT TATTAATTGT ACATTACGTA CTGCAATAAA TACTCTACCC 

1101 TTTTTATGAT TAGAGTCCCG CAATTATACA TTTAATACGC GATAGAAAAC 
AAAAAT AC TA ATCTCAGGGC GTTAATATGT AAATTATGCG CTATCTTTTG 

1151 AAAATATAGC GCGCAAACTA GGATAAATTA TCGCGCGCGG TGTGATCTAT 
TTTTATATCG CGCGTTTGAT CCTATTTAAT AGCGCGCGCC ACAGTAGATA 



12 01 GTTACTAGAT CGATAAGCTT CTAGAGCGGC CGGTGGAGCT CCAATTCGCC 

CAATGATCTA GCTATTCGAA GATCTCGCCG GCCACCTCGA GGTTAAGCGG 

1251 CTATAGTGAG TCGTATTACG CGCGCTCACT GGCCGTCGTT TTACAACGTC 
GATATCACTC AGCATAATGC GCGCGAGTGA CCGGCAGCAA AATGTTGCAG 

13 01 GTGACTGGGA AAACCCTGGC GTTACCCAAC TTAATCGCCT TGCAGCACAT 

CACTGACCCT TTTGGGACCG CAATGGGTTG AATTAGCGGA ACGTCGTGTA 



1351 CCCCCTTTCG CCAGCTGGCG TAATAGCGAA GAGGCCCGCA CCGATCGCCC 



Ncol 



Not I 



xbal 



Clal Hindi I I 



PvuII 




G 



GGGGGAAAGC GGTCGACCGC ATTATCGCTT CTCCGGGCGT GGCTAGCGGG 

1401 TTCCCAACAG TTGCGGAGCC TGAATGGCGA ATGGGACGCG CCCTGTAGCG 
AAGGGTTGTC AACGCGTCGG AGTTACCGCT TACCCTGCGC GGGACATCGC 

1451 GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT GACCGCTACA 
CGCGTAATTC GCGCCGCCCA CACCACCAAT GCGCGTCGCA CTGGCGATGT 

1501 CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT 
GAACGGTCGC GGGATCGCGG GCGAGGAAAG CGAAAGAAGG GAAGGAAAGA 

1551 CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG GGGCTCCCTT 
GCGGTGCAAG CGGCCGAAAG GGGCAGTTCG AGATTTAGCC CCCGAGGGAA 

1601 TAGGGTTCCG ATTTAGTGCT TTACGGCACC TCGACCCCAA AAAACTTGAT 
ATCCCAAGGC TAAATCACGA AATGCCGTGG AGCTGGGGTT TTTTGAACTA 

1651 TAGGGTGATG GTTCACGTAG TGGGCCATCG CCCTGATAGA CGGTTTTTCG 
ATCCCACTAC CAAGTGCATC ACCCGGTAGC GGGACTATCT GCCAAAAAGC 

17 01 CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA 

GGGAAACTGC AACCTCAGGT GCAAGAAATT ATCACCTGAG AACAAGGTTT 

1751 CTGGAACAAC ACTCAAGCCT ATGTCGGTCT ATTCTTTTGA TTTATAAGGG 
GACGTTGTTG TGAGTTGGGA TAGAGC CAGA TAAGAAAACT AAATATTCCC 

18 01 ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA AATGAGCTGA TTTAACAAAA 

TAAAACGGCT AAAGC CGGAT AACCAATTTT TTACTCGACT AAATTGTTTT 

1851 ATTTAACGCG AATTTTAACA AAATATTAAC GCTTACAATT TAGGTGGCAC 
TAAATTGCGC TTAAAATTGT TTTATAATTG CGAATGTTAA ATCCACCGTG 

1901 TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC 
AAAAGCCCCT TTACACGCGC CTTGGGGATA AACAAATAAA AAGATTTATG 

1951 ATTCAAATAT GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA 
TAAGTTTATA CATAGGCGAG TACTCTGTTA TTGGGACTAT TTACGAAGTT 

2 001 TAATATTGAA AAAGGAAGAG TATGAGTATT CAACATTTCC GTGTCGCCCT 
ATTATAACTT TTTCCTTCTG AT AC T CAT AA GTTGTAAAGG CACAGCGGGA 

2 051 TATTCCCTTT TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT CACCCAGAAA 
ATAAGGGAAA AAACGCCGTA AAACGGAAGG ACAAAAACGA GTGGGTCTTT 



2101 CGCTGGTGAA AGTAAAAGAT GCTGAAGATC AGTTGGGTGC ACGAGTGGGT 
GCGACCACTT TCATTTTCTA CGACTTCTAG TCAACCCACG TGCTCACCCA 

2151 TACATCGAAC TGGATCTCAA CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC 
ATGTAGCTTG ACCTAGAGTT GTCGCCATTC TAGGAACTCT CAAAAGCGGG 

22 01 CGAAGAACGT TTTCCAATGA TGAGCACTTT TAAAGTTCTG CTATGTGGCG 
GCTTCTTGCA AAAGGTTACT ACTCGTGAAA ATTTCAAGAC GATACACCGC 



ApaLI 




2 251 CGGTATTATC CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA 
GCCATAATAG GGCATAAGTG CGGCCCGTTC TCGTTGAGCC AGCGGCGTAT 

23 01 CACTATTCTC AGAATGACTT GGTTGAGTAC TCACCAGTCA CAGAAAAGCA 
GTGATAAGAG TCTTACTGAA CCAACTCATG AGTGGTCAGT GTCTTTTCGT 

2 3 51 TCTTACGGAT GGCATGACAG TAAGAGAATT ATGCAGTGCT GCCATAACCA 
AGAATGCCTA CCGTACTGTC ATTCTCTTAA TACGTCACGA CGGTATTGGT 

2401 TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG 
ACTGACTATT GTGACGCCGG TTGAATGAAG ACTGTTGCTA GCCTCCTGGC 

2451 AAGGAG CTAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT 
TTCCTCGATT GGCGAAAAAA CGTGTTGTAC CCCCTAGTAC ATTGAGCGGA 

25 01 TGATCGTTGG GAACCGGAGC TGAATGAAGC CATACCAAAC GACGAGCGTG 
ACTAGCAACC CTTGGCCTCG ACTTACTTCG GTATGGTTTG CTGCTCGCAC 

2551 ACACCACGAT GCCTGTAGCA ATGGCAACAA CGTTGCGCAA ACTATTAACT 
TGTGGTGCTA CGGACATCGT TACCGTTGTT GCAACGCGTT TGATAATTGA 

2 6 01 GGCGAACTAC TTACTCTAGC TTCCCGGCAA CAATTAATAG ACTGGATGGA 
CCGCTTGATG AATGAGATCG AAGGGCCGTT GTTAATTATC TGACCTACCT 

2651 GGCGGATAAA GTTGCAGGAC CACTTCTGCG CTCGGCCCTT CCGGCTGGCT 
CCGCCTATTT CAACGTCCTG GTGAAGACGC GAGCCGGGAA GGCCGACCGA 

27 01 GGTTTATTGC TGATAAATCT GGAGCCGGTG AGCGTGGGTC TCGCGGTATC 

CCAAATAACG ACTATTTAGA CCTCGGCCAC TCGCACCCAG AG C GC CAT AG 

2 7 51 ATTGCAGCAC TGGGGC CAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA 
TAACGTCGTG ACCCCGGTCT ACCATTCGGG AGGGCATAGC ATCAATAGAT 

28 01 CACGACGGGG AGTCAGGCAA CTATGGATGA ACGAAATAGA CAGATCGCTG 

GTGCTGCCCC TCAGTCCGTT GATACCTACT TGCTTTATCT GTCTAGCGAC 

2 851 AGATAGGTGC CTCACTGATT AAGCATTGGT AACTGT CAGA CCAAGTTTAC 
TCTATCCACG GAGTGACTAA TTCGTAACCA TTGACAGTCT GGTTCAAATG 

2 901 T C ATAT AT AC TTTAGATTGA TTTAAAACTT CATTTTTAAT TTAAAAGGAT 
AGTATATATG AAATCTAACT AAATTTTGAA GTAAAAATTA AATTTTCCTA 

2 951 CTAGGTGAAG ATCCTTTTTG ATAATCTCAT GACCAAAATC CCTTAACGTG 

GATCCACTTC TAGGAAAAAC TATTAGAGTA CTGGTTTTAG GGAATTGCAC 

3 0 01 AGTTTTCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT 

TCAAAAGCAA GGTGACTCGC AGTCTGGGGC ATCTTTTCTA GTTTCCTAGA 

3 051 TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA 
AGAACTCTAG GAAAAAAAGA CGCGCATTAG ACGACGAACG TTTGTTTTTT 

3101 ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC 
TGGTGGCGAT GGTCGCCACC AAACAAACGG CCTAGTTCTC GATGGTTGAG 




3151 TTTTTCCGAA GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGTC 
AAAAAGGCTT CCATTGACCG AAGTCGTCTC GCGTCTATGG TTTATGACAG 

32 01 CTTCTAGTGT AGCCGTAGTT AGGCCACGAC TTCAAGAACT CTGTAGCACC 
GAAGATCACA TCGGCATCAA TCCGGTGGTG AAGTTCTTGA GACATCGTGG 

32 51 GCCTACATAC CTCGCTCTGC TAAT C CTGTT ACCAGTGGCT GCTGCCAGTG 

CGGATGTATG GAGCGAGACG ATTAGGACAA TGGTCACCGA CGACGGTCAC 

33 01 GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT 

CGCTATTCAG CACAGAATGG CCCAACCTGA GTTCTGCTAT CAATGGCCTA 



3 3 51 AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT 
TTCCGCGTCG CCAGCCCGAC TTGCCCCCCA AGCACGTGTG TCGGGTCGAA 

34 01 GGAGCGAACG ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG 
CCTCGCTTGC TGGATGTGGC TTGACTCTAT GGATGTCGCA CTCGATACTC 

34 51 AAAGCGCCAC GCTTCCCGAA GGGAGAAAGG CGGACAGGTA TCCGGTAAGC 
TTTCGCGGTG CGAAGGGCTT CCCTCTTTCC GCCTGTCCAT AGGCCATTCG 

3 501 GGCAGGGTCG GAACAGGAGA GCGCACGAGG GAGCTTCCAG GGGGAAACGC 
CCGTCCCAGC CTTGTCCTCT CGCGTGCTCC CTCGAAGGTC CCCCTTTGCG 

3 551 CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA CTTGAGCGTC 
GAC CAT AGAA AT AT C AGGAC AGCCCAAAGC GGTGGAGACT GAACTCGCAG 

3 6 01 GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC 
CTAAAAACAC TACGAGCAGT CCCCCCGCCT CGGATACCTT TTTGCGGTCG 

3 651 AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT 
TTGCGCGGGA AAAATGCCAA GGAC CGGAAA ACGACCGGAA AACGAGTGTA 

3 701 GTTCTTTCCT GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCT 
CAAGAAAGGA CGCAATAGGG GAC TAAGACA CCTATTGGCA TAATGGCGGA 

3 751 TTGAGTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA GCGCAGCGAG 
AACTCACTCG ACTATGGCGA GCGGCGTCGG CTTGCTGGCT CGCGTCGCTC 

3 801 TCAGTGAGCG AGGAAGCGGA AGAGCGCCCA ATACGCAAAC CGCCTCTCCC 
AGTCACTCGC TCCTTCGCCT TCTCGCGGGT TATGCGTTTG GCGGAGAGGG 



3 851 CGCGCGTTGG CCGATTCATT AATGCAGCTG GCACGACAGG TTTCCCGACT 
GCGCGCAACC GGCTAAGTAA TTACGTCGAC CGTGCTGTCC AAAGGGCTGA 

3 901 GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAGTTA GCTCACTCAT 
CCTTTCGCCC GTCACTCGCG TTGCGTTAAT TACACTCAAT CGAGTGAGTA 

3 951 TAGGCACCCC AGGCTTTACA CTTTATGCTT CCGGCTCGTA TGTTGTGTGG 



ApaLI 



PvuII 




ATCCGTGGGG TCCGAAATGT GAAATACGAA GGCCGAGCAT ACAACACACC 

4 0 01 AATTGTGAGC GGATAACAAT TTCAGACAGG AAACAGCTAT GACCATGATT 
TTAACACTCG CCTATTGTTA AAGTGTGTCC TTTGTCGATA CTGGTACTAA 

4051 ACGCCAAGCG CGCAATTAAC CCTCACTAAA GGGAACAAAA GCTGGGTACC 
TGGGGTTCGC GCGTTAATTG GGAGTGATTT CCCTTGTTTT CGACCCATGG 



4101 GGGCCCCCCC TCGAGGTCAT TCATATGCTT GAGAAGAGAG TCGGGATAGT 
CCCGGGGGGG AGCTCCAGTA AGTATACGAA CTCTTCTCTC AGCCCTATCA 

4151 CCAAAATAAA ACAAAGGTAA GATTACCTGG TCAAAAGTGA AAACATCAGT 
GGTTTTATTT TGTTTCCATT CTAATGGACC AGTTTTCACT TTTGTAGTCA 

42 01 TAAAAGGTGG TATAAGTAAA ATATGGGTAA TAAAAGGTGG CCCAAAGTGA 
ATTTTCCACC ATATTCATTT TATAGCCATT ATTTTCCACC GGGTTTCACT 

42 51 AATTTACTCT TTTCTACTAT TATAAAAATT GAGGATGTTT TGTCGGTACT 

TTAAATGAGA AAAGATGATA ATATTTTTAA CTCCTACAAA ACAGCCATGA 

43 01 TTGATACGTC ATTTTTGTAT GAATTGGTTT TTAAGTTTAT TCGCGATTTG 

AACTATGCAG TAAAAACATA CTTAACCAAA AATTCAAATA AGCGCTAAAC 

43 51 GAAATG CATA TCTGTATTTG AGTCGGTTTT TAAGTTCGTT GCTTTTGTAA 

CTTTACGTAT AGACATAAAC TCAGCCAAAA ATTCAAGCAA CGAAAACATT 

44 01 ATAGAGAGGG ATTTGTATAA GAAATATCTT TAAAAAACCC ATATGCTAAT 

TATGTCTCCC T AAA CAT AT T CTTTATAGAA ATTTTTTGGG TATACGATTA 



44 51 TTGACATAAT TTTTGAGAAA AATATATATT CAGGCGAATT CCACAATGAA 

AACTGTATTA AAAACTCTTT TTATATATAA GTCCGCTTAA GGTGTTACTT 

45 01 CAATAATAAG ATTAAAATAG CTTGCCCCCG TTGCAGCGAT GGGTATTTTT 

GTTATTATTC TAATTTTATC GAACGGGGGC AACGTCGCTA CCCATAAAAA 

4551 TCTAGTAAAA TAAAAGATAA ACTTAGACTC AAAACATTTA CAAAAACAAC 
AGATCATTTT ATTTTCTATT TGAATCTGAG TTTTGTAAAT GTTTTTGTTG 

46 01 CCCTAAAGTC CTAAAGCCCA AAGTGCTATG CACGATCCAT AGCAAGCGCA 

GGGATTTCAG GATTTCGGGT TTCACGATAC GTGCTAGGTA TCGTTCGGGT 

46 51 GCCCAACCCA ACCCAACCCA ACCCACCCCA GTGCAGCCAA CTGGCAAATA 

CGGGTTGGGT TGGGTTGGGT TGGGTGGGGT CACGTCGGTT GACCGTTTAT 

4701 GTCTCCACCC CCGGCACTAT CACCGTGAGT TGTCCGCACC ACCGCACGTC 
CAGAGGTGGG GGCCGTGATA GTGGCACTCA ACAGGCGTGG TGGCGTGCAG 

47 51 TCGCAGCCAA AAAAAAAAAA AGAAAGAAAA AAAAGAAAAA GAAAAACAGC 

AGCGTCGGTT TTTTTTTTTT TCTTTCTTTT TTTTCTTTTT CTTTTTGTCG 



Aval 



EcoRI 




48 01 AGGTGGGTCC GGGTCGTGGG GGCCGGAAAA GCGAGGAGGA TCGCGAGCAG 
TCCACCCAGG CCCAGCACCC CCGGCCTTTT CGCTCCTCGT AGCGCTCGTC 

4 851 CGACGAGGCC CGGCCCTCCC TCCGCTTCCA AAGAAACGCC CCCCATCGCC 
GCTGCTCGGG GCCGGGAGGG AGGCGAAGGT TTCTTTGCGG GGGGTAGCGG 

4901 ACTATATACA TACCCCCCCC TCTCCTCCCA TCCCCCCAAC CCTACCACGA 
TGATATATGT ATGGGGGGGG AGAGGAGGGT AGGGGGGTTG GGATGGTGGT 

4 951 CCACCACCAC CACCTCCTCC CCCCTCGCTG CCGGACGACG AGCTCCTCCC 
GGTGGTGGTG GTGGAGGAGG GGGGAGCGAC GGCCTGCTGC TCGAGGAGGG 

5001 CCCTCCCCCT CCGCCGCCGC CGGTAACCAC CCCGCCCCTC TCCTCTTTCT 
GGGAGGGGGA GGCGGCGGCG GCCATTGGTG GGGCGGGGAG AGGAGAAAGA 

50 51 TTCTCCGTTT TTTTTTTCGT CTCGGTCTCG ATCTTTGGCC TTGGTAGTTT 
AAGAGGCAAA AAAAAAAGCA GAGCCAGAGC TAGAAACCGG AAC CATC AAA 

5101 GGGTGGGCGA GAGCGGCTTC GTCGCCCAGA TCGGTGCGCG GGAGGGGCGG 
CCCACCCGCT CTCGCCGAAG CAGCGGGTCT AGCCACGCGC CCTCCCCGCC 



5151 GATCTCGCGG CTGGCGTCTC CGGGCGTGAG TCGGCCCGGA TCCTCGCGGG 
CTAGAGCGCC GACCGCAGAG GCCCGCACTC AGCCGGGCCT AGGAGCGCCC 

52 01 GAATGGGGCT CTCGGATGTA GATCTTCTTT CTTTCTTCTT TTTGTGGTAG 
CTTACCCCGA GAGCCTACAT C T AG AAGAAA GAAAGAAGAA AAACACCATC 

52 51 AATTTGAATC CCTCAGCATT GTTCATCGGT AGTTTTTCTT TTCATGATTT 

TTAAACTTAG GGAGTCGTAA CAAGTAGCCA TCAAAAAGAA AAGTAC T AAA 

53 01 GTGACAAATG CAGCCTCGTG CGGAGCTTTT TTGTAGC 

CACTGTTTAC GTCGGAGCAC GCCTCGAAAA AACATCG 



BamHI 




Bglll (4654) 
Bam HI (4622) 
Rice actin 



Hin dill {2) 



E co RI (3920) 
ssmi (3883) 



on 



Nco I (9) 




pGT6 

4773 bp 




sGFP 





M>fl(733) 
5wal(741) 
P^I(749) 
Nos? 

Bss HII (984) 
555 HII (1008) 
C/a I (1035) 
tfmdIII (1040) 
Xfcal (1045) 
Bss HII (1095) 
f1 ori 



amp 



Sequence for pGT6 



Hindlll Ncol 



1 AAGCTTACCA TGGTGAGCAA GGGCGAGGAG CTGTTCACCG GGGTGGTGCC 
CATCGTGGTC GAGCTGGACG 

TTCGAATGGT ACCACTCGTT CCGGCTCCTC GACAAGTGGC CCC&CCACGG 
GTAGGACCAG CTCGACCTGC 

71 GCGACGTGAA CGGCCACAAG TTCAGCGTGT CCGGCGAGGG CGAGGGCGAT 
GCCACCTACG GCAAGCTGAC 

CGCTGCACTT GCCGGTGTTC AAGTCGCACA GGCCGCTCCC GCTCCGGCTA 
CGGTGGATGC CGTTCGACTG 

141 CCTGAAGTTC ATCTGCACCA CCGGCAAGCT GCCCGTGCCC TGGCCCACCC 
TCGTGACCAC CTTCACGTAC 

GGACTTCAAG TAGACGTGGT GGCCGTTCGA CGGGCACGGG ACCGGGTGGG 
AGCACTGGTG GAAGTGGATG 

211 GGCGTGCAGT GCTTCAGCCG CTACCCCGAC CACATGAAGC AGCACGACTT 
CTTCAAGTCC GCCATGCCCG 

CCGCACGTCA CGAAGTCGGC GATGGGGCTG GTGTACTTCG TCGTGCTGAA 
GAAGTTCAGG CGGTACGGGC 

2 81 AAGGCTACGT CCAGGAGCGC ACCATCTTCT TCAAGGACGA CGGCAACTAC 
AAGACCCGCG CCGAGGTGAA 

TTCCGATGCA GGTCCTCGCG TGGTAGAAGA AGTTCCTGCT GCCGTTGATG 
TTCTGGGCGC GGCTCCACTT 

3 51 GTTCGAGGGC GACACCCTGG TGAACCGCAT CGAGCTGAAG GGCATCGACT 
TCAAGGAGGA CGGCAACATC 

CAAGCTCCCG CTGTGGGACC ACTTGGCGTA GCTCGACTTC CCGTAGCTGA 
AGTTCCTCCT GCCGTTGTAG 

421 CTGGGGCACA AGCTGGAGTA CAACTACAAC AGCCACAACG TCTATATCAT 
GGCCGACAAG CAGAAGAACG 

GACCCCGTGT TCGACCTCAT GTTGATGTTG TCGGTGTTGG AGATATAGTA 
CCGGCTGTTC GTCTTCTTGC 

491 GCATCAAGGT GAACTTCAAG ATCCGCCACA ACATCGAGGA CGGCAGCGTG 
CAGCTCGCCG ACCACTACCA 

CGTAGTTCCA CTTGAAGTTC TAGGCGGTGT TGTAGCTCCT GCCGTCGCAC 
GTCGAGCGGC TGGTGATGGT 

5 61 GCAGAACACC CCCATCGGCG ACGGCCCCGT GCTGCTGCCC GACAACCACT 
ACCTGAGCAC CCAGTCCGCC 

CGTCTTGTGG GGGTAGCCGC TGCCGGGGCA CGACGACGGG CTGTTGGTGA 
TGGACTCGTG GGTCAGGCGG 

631 CTGAGCAAAG ACCCCAACGA GAAGCGCGAT CACATGGTCC TGCTGGAGTT 
CGTGACCGCC GCCGGGATCA 

GACTCGTTTC TGGGGTTGCT CTTCGCGCTA GTGTACCAGG ACGACCTCAA 
GCACTGGCGG CGGCCCTAGT 




Smal 



NotI PstI 

701 CTCACGGCAT GGACGAGCTG TACAAGTAAA GCGGCCGCCC GGGCTGCAGG 
GAAACCACTG AAGGATGAGC 

GAGTGCCGTA CCTGCTCGAC ATGTTCATTT CGCCGGCGGG CCCGACGTCC 

CTTTGGTGAC TTCCTACTCG 

771 TGTAAAGAAG CAGATCGTTC AAACATTTGG CAATAAAGTT TCTTAAGATT 
GAATCCTGTT GCCGGTCTTG 

ACATTTCTTC GTCTAGCAAG TTTGTAAACC GTTATTTCAA AGAATTCTAA 

CTTAGGACAA CGGCCAGAAC 

841 CGATGATTAT CATATAATTT CTGTTGAATT ACGTTAAGCA TGTAATAATT 
AACATGTAAT GCATGACGTT 

GCTACTAATA GTATATTAAA GACAACTTAA TGCAATTCGT ACATTATTAA 

TTGTACATTA CGTACTGCAA 

911 ATTTATGAGA TGGGTTTTTA TGATTAGAGT CCCGCAATTA TACATTTAAT 

ACGCGATAGA AAACAAAATA 

TAAATACTCT AC C C AAAAAT ACTAATCTCA GGGCGTTAAT ATGTAAATTA 

TGCGCTATCT TTTGTTTTAT 



Xbal 



BSSHII BssHII 

Clal Hindi I I 



981 TAGCGCGCAA AC TAGGAT AA ATTATCGCGC GCGGTGTCAT CTATGTTACT 
AGAT CGATAA GCTTCTAGAG 

ATCGCGCGTT TGATCCTATT TAATAGCGCG CGCCACAGTA GATACAATGA 

TCTAGCTATT CGAAGATCTC 

BssHII 

1051 CGGCCGGTGG AGCTCCAATT CGCCCTATAG TGAGTCGTAT TACGCGCGCT 
CACTGGCCGT CGTTTTACAA 

GCCGGCCACC TCGAGGTTAA GCGGGATATC ACTCAGCATA ATGCGCGCGA 

GTGACCGGCA GCAAAATGTT 

1121 CGTCGTGACT GGGAAAACCC TGGCGTTACC CAACTTAATC GCCTTGCAGC 
ACATCCCCCT TTCGCCAGCT 

GCAGCACTGA CCCTTTTGGG ACCGCAATGG GTTGAATTAG CGGAACGTCG 
TGTAGGGGGA AAGCGGTCGA 

1191 GGCGTAATAG CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGC 
AGCCTGAATG GCGAATGGGA 

CCGCATTATC GCTTCTCCGG GCGTGGCTAG CGGGAAGGGT TGTCAACGCG 

TCGGACTTAC CGCTTACCCT 

o 



1261 CGCGCCCTGT AGCGGCGCAT TAAGCGCGGC GGGTGTGGTG GTTACGCGCA 
GCGTGACCGC TACACTTGCC 

GCGCGGGACA TCGCCGCGTA ATTCGCGCCG CCCACACCAC CAATGCGCGT 
CGCACTGGCG ATGTGAACGG 

13 31 AGCGCCCTAG CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT TTCTCGCCAC 
GTTCGCCGGC TTTCCCCGTC 

TCGCGGGATC GCGGGCGAGG AAAGCGAAAG AAGGGAAGGA AAGAGCGGTG 
CAAGCGGCCG AAAGGGGCAG 

1401 AAGGTCTAAA TCGGGGGCTC CCTTTAGGGT TCCGATTTAG TGCTTTACGG 
CACCTCGACC CCAAAAAACT 

TTCGAGATTT AGCCCCCGAG GGAAATCCCA AGGCTAAATC ACGAAATGCC 
GTGGAGCTGG GGTTTTTTGA 

14 71 TGATTAGGGT GATGGTTCAC GTAGTGGGCC ATCGCCCTGA TAGACGGTTT 
TTCGCCCTTT GACGTTGGAG 

ACTAATCCCA CTACCAAGTG CATCACCCGG TAGCGGGACT ATCTGCCAAA 
AAGCGGGAAA CTGCAACCTC 

1541 TCCACGTTCT TTAATAGTGG ACTCTTGTTC CAAACTGGAA CAACACTCAA 
CCCTATCTCG GTCTATTCTT 

AGGTGCAAGA AATTATCACC TGAGAACAAG GTTTGACCTT GTTGTGAGTT 
GGGATAGAGC CAGATAAGAA 

1611 TTGATTTATA AGGGATTTTG CCGATTTCGG CCTATTGGTT AAAAAATGAG 
CTGATTTAAC AAAAATTTAA 

AACTAAATAT TCCCTAAAAC GGCTAAAGCC GGATAACCAA TTTTTTACTC 
GACTAAATTG TTTTTAAATT 

16 81 CGCGAATTTT AACAAAATAT TAACGCTTAC AATTTAGGTG GCACTTTTCG 
GGGAAATGTG CGCGGAACCC 

GCGCTTAAAA TTGTTTTATA ATTGCGAATG TTAAATCCAC CGTGAAAAGC 
CCCTTTACAC GCGCCTTGGG 

1751 CTATTTGTTT ATTTTTCTAA ATACATTCAA ATATGTATCC GCTCATGAGA 
CAATAACCCT GATAAATGCT 

GATAAACAAA TAAAAAGATT TATGTAAGTT TATACATAGG CGAGTACTCT 
GTTATTGGGA CTATTTACGA 

1821 TCAATAATAT TGAAAAAGGA AGAGTATGAG TATTCAACAT TTCCGTGTCG 
CCCTTATTCC CTTTTTTGCG 

AGTTATTATA ACTTTTTCCT TCTCATACTC ATAAGTTGTA AAGGCACAGC 
GGGAATAAGG GAAAAAACGC 

18 91 GCATTTTGCC TTCCTGTTTT TGCTCACCCA GAAACGCTGG TGAAAGTAAA 
AGATGCTGAA GATCAGTTGG 

CGTAAAAGGG AAGGACAAAA ACGAGTGGGT CTTTGCGACC ACTTTCATTT 

TCTACGACTT CTAGTCAACC 

19 61 GTGCACGAGT GGGTTACATC GAACTGGATC TCAACAGCGG TAAGATCCTT 
GAGAGTTTTC GCCCCGAAGA 



CACGTGCTCA CCCAATGTAG CTTGACCTAG AGTTGTCGCC ATTCTAGGAA 
CTCTCAAAAG CGGGGCTTCT 



2 031 ACGTTTTCCA ATGATGAGCA CTTTTAAAGT TCTGGTATGT GGCGCGGTAT 
TATCCCGTAT TGACGCCGGG 

TGCAAAAGGT TACTACTCGT GAAAATTTCA AGACGATACA CCGCGCCATA 

AT AGGG CAT A ACTGCGGCCC 

2101 CAAGAGCAAC TCGGTCGCCG CATACACTAT TCTCAGAATG ACTTGGTTGA 
GTACTCACCA GTCACAGAAA 

GTTCTCGTTG AGCCAGCGGC GTATGTGATA AGAGTCTTAC TGAACCAACT 
CATGAGTGGT CAGTGTCTTT 

2171 AGCATCTTAC GGATGGCATG ACAGTAAGAG AATTATGCAG TGCTGCCATA 
ACCATGAGTG ATAACACTGC 

TCGTAGAATG CCTACCGTAC TGTCATTCTC TTAATACGTC ACGACGGTAT 
TGGTACTCAC TATTGTGACG 

2241 GGCCAACTTA CTTCTGACAA CGATCGGAGG ACCGAAGGAG CTAACCGCTT 
TTTTGCACAA CATGGGGGAT 

CCGGTTGAAT GAAGACTGTT GCTAGCCTCC TGGCTTCCTC GATTGGCGAA 
AAAACGTGTT GTACCCCCTA 

2311 CATGTAACTC GCCTTGATCG TTGGGAACCG GAGCTGAATG AAGCCATACC 
AAACGAC GAG CGTGACACCA 

GTACATTGAG CGGAACTAGC AACCCTTGGC CTGGACTTAC TTCGGTATGG 
TTTGCTGCTC GCACTGTGGT 

23 81 CGATGCCTGT AGCAATGGCA ACAACGTTGC GCAAACTATT AACTGGCGAA 
CTACTTACTC TAGCTTCCCG 

GCTACGGACA TCGTTACCGT TGTTGCAACG CGTTTGATAA TTGACCGCTT 

GATGAATGAG ATCGAAGGGC 

2451 GCAACAATTA ATAGACTGGA TGGAGGCGGA TAAAGTTGCA GGACCACTTC 
TGCGCTCGGC CCTTCCGGCT 

CGTTGTTAAT TATCTGACCT ACCTCCGCCT ATTTCAACGT CCTGGTGAAG 
ACGCGAGCCG GGAAGGCCGA 

2521 GGCTGGTTTA TTGCTGATAA ATCTGGAGCC GGTGAGCGTG GGTCTCGCGG 
TATCATTGCA GCACTGGGGC 

CCGACCAAAT AACGACTATT TAGACCTCGG CCACTCGCAC CCAGAGCGCC 
ATAGTAACGT CGTGACCCCG 

2591 CAGATGGTAA GCCCTCCCGT ATCGTAGTTA TCTACACGAC GGGGAGTCAG 
GCAACTATGG ATGAACGAAA 

GTCTACCATT CGGGAGGGCA TAGCATCAAT AGATGTGCTG CCCCTCAGTC 
CGTTGATACC TACTTGCTTT 

2 661 TAGACAGATC GCTGAGATAG GTGCCTCACT GATTAAGCAT TGGTAACTGT 
CAGAC CAAGT TTACTCATAT 

ATCTGTCTAG CGACTCTATC CACGGAGTGA CTAATTCGTA ACCATTGACA 
GTCTGGTTCA AATGAGTATA 




2731 ATACTTTAGA TTGATTTAAA ACTTCATTTT TAATTTAAAA GGATCTAGGT 
GAAGATCCTT TTTGATAATC 

TATGAAATCT AACTAAATTT TGAAGTAAAA ATTAAATTTT CCTAGATCCA 
CTTCTAGGAA AAACTATTAG 

2 801 TCATGACCAA AATCCCTTAA CGTGAGTTTT CGTTCCACTG AGCGTCAGAC 
CCCGTAGAAA AGAT CAAAGG 

AGTACTGGTT TTAGGGAATT GCACTCAAAA GCAAGGTGAC TCGCAGTCTG 
GGGCATCTTT TCTAGTTTCC 

2871 ATCTTCTTGA GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA 
AAAAACCACC GCTACCAGCG 

TAGAAGAACT CTAGGAAAAA AAGACGCGCA TTAGACGACG AACGTTTGTT 
TTTTTGGTGG CGATGGTCGC 

2 941 GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC 
TGGCTTCAGC AGAGGGCAGA 

CACCAAACAA ACGGCCTAGT TCTCGATGGT TGAGAAAAAG GCTTCCATTG 
ACCGAAGTCG TCTCGCGTCT 

3 011 T AC C AAATAC TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 
AACTCTGTAG CACCGCCTAC 

ATGGTTTATG ACAGGAAGAT CACATCGGCA TCAATCCGGT GGTGAAGTTC 
TTGAGACATC GTGGCGGATG 

3 081 ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA 
AGTCGTGTCT TACCGGGTTG 

TATGGAGCGA GACGATTAGG ACAATGGTCA CCGACGACGG TCACCGCTAT 
TCAGCACAGA ATGGCCCAAC 

3151 GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG 
GGGTTCGTGC ACAGAGCCCA 

CTGAGTTCTG CTATCAATGG CCTATTCCGC GTCGCCAGCC CGACTTGCCC 
CCCAAGCACG TGTGTCGGGT 

3221 GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA GCGTGAGCTA 
TGAGAAAGCG CCACGCTTCC 

CGAACCTCGC TTGCTGGATG TGGCTTGACT CTATGGATGT CGCACTCGAT 
ACTCTTTCGC GGTGCGAAGG 

3291 CGAAGGGAGA AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 
GAGAGCGCAC GAGGGAGCTT 

GCTTCCCTCT TTCCGCCTGT CCATAGGCCA TTCGCCGTCC CAGCCTTGTC 
CTCTCGCGTG CTCCCTCGAA 

33 61 CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT 
CTGACTTGAG CGTCGATTTT 

GGTCCCCCTT TGCGGAC CAT AGAAATAT C A GGACAGCCCA AAGCGGTGGA 
GACTGAACTC GCAGCTAAAA 

3431 TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 
GCCTTTTTAC GGTTCCTGGC 

ACACTACGAG CAGTCCCCCC GCCTCGGATA CCTTTTTGCG GTCGTTGCGC 
CGGAAAAATG CCAAGGACCG 




3501 CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA TCCCCTGATT 
CTGTGGATAA CCGTATTACC 

GAAAACGACC GGAAAACGAG TGTACAAGAA AGGACGCAAT AGGGGAC TAA 
GACACCTATT GGCATAATGG 

3 571 GCCTTTGAGT GAGCTGATAC CGCTCGCCGC AGCCGAACGA CCGAGCGCAG 
CGAGTCAGTG AGCGAGGAAG 

CGGAAACTCA CTCGACTATG GCGAGCGGCG TCGGCTTGCT GGCTCGCGTC 
GCTCAGTCAC TCGCTCCTTG 

3 641 CGGAAGAGCG CCCAATACGC AAACCGCCTC TCCCCGCGCG TTGGCCGATT 
CATTAATGCA GCTGGCACGA 

GCCTTCTCGC GGGTTATGCG TTTGGCGGAG AGGGGCGCGC AACCGGCTAA 
GTAATTACGT CGACCGTGCT 

3 711 CAGGTTTCCC GACTGGAAAG CGGGCAGTGA GCGCAACGCA ATTAATGTGA 
GTTAGCTCAG TCATTAGGCA 

GTCCAAAGGG CTGACCTTTC GCCCGTCACT CGCGTTGCGT TAATTACACT 
CAATCGAGTG AGTAATC CGT 

3 781 CCCCAGGCTT TACACTTTAT GCTTCCGGCT GGTATGTTGT GTGGAATTGT 
GAGCGGATAA CAATTTCACA 

GGGGTCCGAA ATGTGAAATA CGAAGGCCGA G C AT AC AAC A CACCTTAACA 
CTCGCCTATT GTTAAAGTGT 



3 851 CAGGAAACAG C TATGAC CAT GATTACGCCA AGCGCGCAAT TAACCCTCAC 
TAAAGGGAAC AAAAGCTGGA 

GTCCTTTGTC GATACTGGTA CTAATGCGGT TCGCGCGTTA ATTGGGAGTG 
ATTTCCCTTG TTTTCGACCT 



3 921 ATTC CACAAT GAACAATAAT AAGATTAAAA TAGCTTGCCC CCGTTGCAGC 
GATGGGTATT TTTTCTAGTA 

TAAGGTGTTA CTTGTTATTA TTCTAATTTT ATCGAACGGG GGCAACGTCG 
CTACCCATAA AAAAGAT CAT 

3 991 AAATAAAAGA TAAACTTAGA CTCAAAACAT TTACAAAAAC AACCCCTAAA 
GTCCTAAAGC CCAAAGTGCT 

TTTATTTTCT ATTTGAATCT GAGTTTTGTA AATGTTTTTG TTGGGGATTT 
CAGGATTTCG GGTTTCACGA 

4 061 ATGCACGATC CATAGCAAGC CCAGCCCAAC CCAACCCAAC CCAACCCACC 
CCAGTGCAGC CAACTGGCAA 

TACGTGCTAG GTATCGTTCG GGTCGGGTTG GGTTGGGTTG GGTTGGGTGG 
GGTCACGTCG GTTGACCGTT 

4131 ATAGTCTCCA CCCCCGGCAC TATCACCGTG AGTTGTCCGC ACCACCGCAC 
GTCTCGCAGC CAAAAAAAAA 



BssHlI 



EcoRI 



EcoRI 




TATCAGAGGT GGGGGCCGTG ATAGTGGCAC TCAACAGGCG TGGTGGCGTG 
CAGAGCGTCG GTTTTTTTTT 



42 01 AAAAGAAAGA AAAAAAAGAA AAAGAAAAAC AGCAGGTGGG TCCGGGTCGT 
GGGGGCCGGA AAAGCGAGGA 

TTTTCTTTCT TTTTTTTCTT TTTCTTTTTG TCGTCCACCC AGGCCCAGCA 
CCCCCGGCCT TTTCGCTCCT 

42 71 GGATCGCGAG CAGCGACGAG GCCCGGGCCT CCCTCCGCTT CCAAAGAAAC 
GCCCCCCATC GCCACTATAT 

CCTAGCGCTC GTCGCTGCTC CGGGCCGGGA GGGAGGCGAA GGTTTCTTTG 
CGGGGGGTAG CGGT GATATA 

43 41 ACATACCCCC CCCTCTCCTC CCATCCCCCC AACCCTACCA CCACCACCAC 
CACCACCTCC TCCCCCCTCG 

TGTATGGGGG GGGAGAGGAG GGTAGGGGGG TTGGGATGGT GGTGGTGGTG 
GTGGTGGAGG AGGGGGGAGC 

4411 CTGCCGGACG ACGAGCTCCT CCCCCCTCCC CCTCCGCCGC CGGCGGTAAC 
CACCCCGCCC CTCTCCTCTT 

GACGGCCTGC TGCTCGAGGA GGGGGGAGGG GGAGGCGGCG GCGGCCATTG 
GTGGGGCGGG GAGAGGAGAA 

44 81 TCTTTCTCCG TTTTTTTTTT CGTCTCGGTC TCGATCTTTG GCCTTGGTAG 
TTTGGGTGGG CGAGAGCGGC 

AGAAAGAGGC AAAAAAAAAA GCAGAGCCAG AGCTAGAAAC CGGAACCATC 
AAACCCACCC GCTCTCGCCG 

4551 TTCGTCGCCC AGATCGGTGC GCGGGAGGGG CGGGATCTCG CGGCTGGCGT 
CTCCGGGCGT GAGTCGGCCC 

AAGCAGCGGG TCTAGCCACG CGCCCTCCCC GCCCTAGAGC GCCGACCGCA 
GAGGCGCGCA CTCAGCCGGG 



4621 GGATCCTCGC GGGGAATGGG GCTCTCGGAT GTAGATCTTC TTTCTTTCTT 
CTTTTTGTGG TAGAATTTGA 

CCTAGGAGCG CCCCTTACCC CGAGAGCCTA CATCTAGAAG AAAGAAAGAA 
GAAAAACACC ATCTTAAACT 

4 691 ATCCCTCAGC ATTGTTCATC GGTAGTTTTT CTTTTGATGA TTTGTGACAA 
ATGCAGCCTC GTGCGGAGCT 

TAGGGAGTCG TAACAAGTAG CCATCAAAAA GAAAAGTACT AAACACTGTT 
TACGTCGGAG CACGCCTCGA 

47 61 TTTTTGTAGG TAG 



BamHI 



Bglll 



AAAAACATCC ATG 




Kpn\{\) 
Ncol(l) 
Pstl(9l) 




pJQ5 

5034 bp 



Xbal(m) 

Xbal(U\) 
Smal(W) 
SEE promoter 
Sph I (523) 
Pstl(J7A) 
PPI 

Nco I (945) 
aleurain signal/NPIR 
Notl(lOU) 
Sal I (1114) 
^ccl (1115) 
Bam HI (1170) 
Glycosylation 
Intron 1 
FAE 

J/zoI(1552) 
ATcoI (1639) 
Redundancy (stop) 
KDEL 




nos 

BssEll (2095) 
5^HII (2119) 
C/a I (2146) 

dill (2151) 
Xba 1(2156) 
BssEll (2206) 



Sequence for pJQ5 



Ncol 



Kpnl 



1 CATGGGCCAG GTATAATTAT GGGATATCTC AAGCAAATAA TCGAAATATC 
ACCATTGGCT ACAATATCTG 

GTACCCGGTC CATATTAATA CCCTATAGAG TTCGTTTATT AGCTTTATAG 
TGGTAACCGA TGTTATAGAC 



71 AGCTCCGAGT TCTGACTGCA GTCTGGATGA CGCGTGTTGT ATCTAGAACT 
CTAGATAGCA CAGCCACAGC 

TCGAGGCTCA AGACTGACGT CAGACCTACT GCGCACAACA TAGATCTTGA 
GATCTATCGT GTCGGTGTCG 

141 ACCTACAGGA GTGCGACACT TGTGGACTGT AGTAGTGTTG GAGACGGAGC 
TCTTTCCTAC CTCCTGACGT 

TGGATGTCCT CACGCTGTGA ACAC CTGACA T CAT C AC AAC CTCTGCCTCG 
AGAAAGGATG GAGGACTGCA 

211 TGCCGCCGTT GTCCATTCCA ACGGCATCAC TCTCAACCAA TCACGCGCTC 
CCAACAAAAT ATCGTCCCCC 

ACGGCGGCAA CAGGTAAGGT TGCCGTAGTG AGAGTTGGTT AGTGCGCGAG 
GGTTGTTTTA TAGCAGGGGG 

2 81 ATGTCTTGGC GGAGAGAGAG TACATACATG CTGTCGCGCC GTTTTTGTCT 
GAATCTCGCT TCCACTGGCC 

TACAGAACCG CCTCTCTCTC ATGTATGTAC GACAGCGCGG CAAAAACAGA 
CTTAGAGCGA AGGTGACCGG 



351 AATCAGCTCA GCTCCCGGGA GCTCACTCAT TCAAGATCCC ATCGTCGTCG 
TCACCCCTGG CGTCATGGGA 

TTAGTCGAGT CGAGGGCCCT CGAGTGAGTA AGTTCTAGGG TAGCAGCAGC 
AGTGGGGACC GCAGTACCCT 

421 TGGAAAAGAA CCTCCGTTGC TCGGATGAGT CAGCCATATC CCCGAACAGA 
GTACTGCAAG ATAACCCAAT 

ACCTTTTCTT GGAGGCAACG AGCCTACTCA GTCGGTATAG GGGCTTGTCT 
CATGACGTTC TATTGGGTTA 



491 TCAGATTCCC CCAATAGAGA AAGT AT AG C A TGCTTTCGGG TTTTGTTTGG 
CTTAATTGAC TTTATTTTTG 

AGTCTAAGGG GGTTATCTCT TTCATATCGT ACGAAAGCCC AAAACAAACC 
GAATTAACTG AAATAAAAAC 



PstI 



Xbal 



Xbal 



Smal 



SphI 




561 TTGGAGTTGA ATGCTGATTT GTTGTGTAAA ATGCCCAACC ATCTGAATAT 
CGAGACGGAT AATAGGCTGG 

AACCTCAACT TACGACTAAA CAACACATTT TACGGGTTGG TAGACTTATA 
GCTCTGCCTA TTATGCGACC 

631 CTAATTAATT TATAGCAAGA TTCTGTAGTG CACATCGCAA ATATCTTTCT 
GGGCATTACA GCTGGAGGCT 

GATTAATTAA ATATCGTTCT AAGACATCAC GTGTAGCGTT TATAGAAAGA 
CCCGTAATGT CGACCTCCGA 



7 01 TCATCAGCCT GAAACACTCT GCAGAGCCTG AAGCAAGTGG TGAAGCGTGG 
CGATGAGATG GGTATAAAAC 

AGTAGTCGGA CTTTGTGAGA CGTCTCGGAC TTCGTTCACC ACTTCGCACC 
GCTACTCTAC CCATATTTTG 

771 CCCCGGCACC GGGACGCGAG CTGCCGCCTA CCAGTACCAT CTCGCCTCGC 
TCCCCCTGCG GGACGACCCA 

GGGGCCGTGG CCCTGCGCTC GAGGGCGGAT GGTCATGGTA GAGCGGAGCG 
AGGGGGACGG CCTGCTGGGT 

841 GTAAAATACT GTTGCCCACT CGCCGGCGAG ATGGMCGTGC ACAAGGAGGT 
SAACTTCGTS GCCTACCTCC 

CATTTTATGA CAACGGGTGA GCGGCCGCTC TACCKGCACG TGTTCCTCCA 
STTGAAGCAS CGGATGGAGG 



911 TGATGGTSCT CGGCCTCCTC TTGCTCGTST CCGCCATGGA GCACGTGGAC 
GCCAAGGCCT GCACCCKCGA 

ACTAGCASGA GCCGGAGGAG AACGAGCASA GGCGGTACCT CGTGCAGCTG 
CGGTTCCGGA CGTGGGMGCT 



981 GTGCGGCAAC CTCGGCTTCG GCATCTGCCC GGCGGCCGCC TCCACGCAGG 
GCATCTCCGA AGACCTCTAC 

CACGCCGTTG GAGCCGAAGC CGTAGACGGG CCGCCGGCGG AGGTGCGTCC 
CGTAGAGGCT TCTGGAGATG 



1051 AGCCGTTTAG TCGAAATGGC CACTATCTCC CAAGCTGCCT ACGCCGACCT 
GTGCAACATT CCGTCGACTA 

TCGGCAAATC AGCTTTACCG GTGATAGAGG GTTCGACGGA TGCGGCTGGA 
CACGTTGTAA GGCAGCTGAT 



PstI 



Ncol 



Not I 



Sail 



AccI 




BamHI 



1121 TTATCAAGGG AGAGAAAATT TACAATTCTC AAACTGACAT TAACGGATGG 
ATCCTCCGCG ACGACAGCAG 

AATAGTTCCC TCTCTTTTAA ATGTTAAGAG TTTGACTGTA ATTGCCTACC 

TAGGAGGCGC TGCTGTCGTC 

1191 CAAAGAAATA ATCACCGTCT TCCGTGGCAC TGGTAGTGAT ACGAATCTAC 
AACTCGATAC TAACTACACC 

GTTTCTTTAT TAGTGGCAGA AGGCACCGTG ACCATCACTA TGCTTAGATG 

TTGAGCTATG ATTGATGTGG 

12 61 CTCACGCCTT TCGACACCCT ACCACAATGC AACGGTTGTG AAGTACACGG 
TGGATATTAT ATTGGATGGG 

GAGTGCGGAA AGCTGTGGGA TGGTGTTACG TTGCCAACAC TTCATGTGCC 

ACCTATAATA TAACCTACCC 

13 31 TCTCCGTCCA GGACCAAGTC GAGTCGCTTG TCAAACAGCA GGTTAGCCAG 
TATCCGGACT ACGCGGTGAC 

AGAGGCAGGT CCTGGTTCAG GTCAGCGAAC AGTTTGTCGT CCAATCGGTC 

ATAGGCCTGA TGCGCGACTG 

1401 CGTGACCGGC CACKCCCTCG GCGCCTCCCT GGCGGCACTC ACTGCCGCCC 
AGCTGTCTGC GACATACGAC 

GCACTGGCCG GTGMGGGAGC CGCGGAGGGA CCGCCGTGAG TGACGGCGGG 

T C GACAGACG CTGTATGCTG 

1471 AAGATCCGCC TGTACACCTT CGGCGAACCG CGCAGCGGCA ATCAGGCCTT 
CGCGTCGTAC ATGAACGATG 

TTGTAGGCGG ACATGTGGAA GCCGCTTGGC GCGTCGCCGT TAGTCCGGAA 

GCGCAGCATG TACTTGCTAC 

Xhol 

1541 CCTTCCAAGC CTCGAGCCCA GATACGACGC AGTATTTCCG GGTCACTCAT 
GCCAACGACG GCATCCCAAA 

GGAAGGTTCG GAGCTCGGGT CTATGCTGCG TCATAAAGGC CCAGTGAGTA 

CGGTTGCTGC CGTAGGGTTT 

Ncol 



1611 CCTGCCCCCG GTGGAGCAGG GGTACGCCCA TGGCGGTGTA GAGTACTGGA 
GCGTTGATCC TTACAGCGCC 

GGACGGGGGC CACCTCGTCC CCATGCGGGT ACCGCCACAT CTCATGACCT 

CGCAACTAGG AATGTCGCGG 

16 81 CAGAACACAT TTGTCTGCAC TGGGGATGAA GTGCAGTGCT GTGAGGCCCA 

GGGCGGACAG GGTGTGAATA 

GTCTTGTGTA AACAGACGTG ACCCCTACTT CACGTCACGA CACTCCGGGT 

CCCGCCTGTC CCACACTTAT 

1751 ATGCGCACAC GACTTATTTT GGGATGACGA GCGGAGCCTG TACATGGTGA 
TCAGTCATTT CAGCCTCCCC 



TACGCGTGTG CTGAATAAAA CCCTACTGCT CGCCTCGGAC ATGTACCACT 
AGTCAGTAAA GTCGGAGGGG 



1821 GAGTGTACCA GGAAAGATGG ATGTCCTGGA GAGGGGGCCG CGTAACCACT 
GAAGGATGAG CTGTAAAGAA 

CTCACATGGT CCTTTCTACC TACAGGACCT CTCCCCCGGC GCATTGGTGA 
CTTCCTACTC GACATTTCTT 

18 91 GCAGATCGTT CAAACATTTG GCAATAAAGT TTCTTAAGAT TGAATCCTGT 
TGCCGGTCTT GCGATGATTA 

CGTCTAGCAA GTTTGTAAAC CGTTATTTCA AAGAATTCTA ACTTAGGACA 
ACGGCCAGAA CGCTACTAAT 

1961 TCATATAATT TCTGTTGAAT TACGTTAAGC ATGTAATAAT TAACATGTAA 
TGCATGACGT TATTTATGAG 

AGTATATTAA AGACAACTTA ATGCAATTCG TACATTATTA ATTGTACATT 
ACGTACTGCA ATAAATACTC 



2 031 ATGGGTTTTT ATGATTAGAG TCCCGCAATT ATACATTTAA TACGCGATAG 
AAAACAAAAT ATAGCGCGCA 

TACCCAAAAA TACTAATCTC AGGGCGTTAA TATGTAAATT ATGCGCTATC 
TTTTGTTTTA TATGGCGCGT 



2101 AACTAGGATA AATTATCGCG CGCGGTGTCA TCTATGTTAC TAGATCGATA 
AGCTTCTAGA GCGGCCGGTG 

TTGATCCTAT TTAATAGCGC GCGCCACAGT AGATACAATG ATCTAGCTAT 
TCGAAGATCT CGCCGGCCAC 



2171 GAGCTCCAAT TCGCCCTATA GTGAGTCGTA TTACGCGCGC TCACTGGCCG 
TCGTTTTACA ACGTCGTGAC 

CTCGAGGTTA AGC GGGATAT CACTCAGCAT AATGCGCGCG AGTGACCGGC 
AGCAAAATGT TGCAGCACTG 

2241 TGGGAAAACC CTGGCGTTAC CCAACTTAAT CGCCTTGCAG CACATCCCCC 
TTTCGCCAGC TGGCGTAATA 

ACGCTTTTGG GACCGCAATG GGTTGAATTA GCGGAACGTC GTGTAGGGGG 
AAAGCGGTCG ACCGCATTAT 

2311 GCGAAGAGGC CCGCACCGAT CGCCCTTCCC AACAGTTGCG CAGCCTGAAT 
GGCGAATGGG AGGCGCCCTG 

CGCTTCTCCG GGCGTGGCTA GCGGGAAGGG TTGTCAACGC GTCGGACTTA 

CCGCTTACCC TGCGCGGGAC 



BssHII 



Xbal 



BssHII 



Clal Hindi I I 



BssHII 




23 81 TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG 
CTACACTTGC CAGCGCCCTA 

ATCGGCGCGT AATTCGCGCC GCCCACACCA CCAATGCGCG TCGCACTGGC 
GATGTGAACG GTCGCGGGAT 

2451 GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG 
CTTTCCCCGT CAAGCTCTAA 

CGCGGGCGAG GAAAGCGAAA GAAGGGAAGG AAAGAGCGGT GCAAGCGGCC 
GAAAGGGGCA GTTCGAGATT 

2 521 ATCGGGGGCT CCCTTTAGGG TTCCGATTTA GTGCTTTACG GCACCTCGAC 
CCCAAAAAAC TTGATTAGGG 

TAGCCGCCGA GGGAAATCCC AAGGCTAAAT CACGAAATGC CGTGGAGCTG 
GGGTTTTTTG AACTAATCCC 

2591 TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT TTTCGCCCTT 
TGACGTTGGA GTCCACGTTC 

ACTACCAAGT GCATCACCCG GTAGCGGGAC TATCTGCCAA AAAGCGGGAA 
ACTGCAACCT CAGGTGCAAG 

2 661 TTTAATAGTG GACTCTTGTT CCAAACTGGA AC AAC AC TC A ACCCTATCTC 
GGTCTATTCT TTTGATTTAT 

AAATTATCAC CTGAGAACAA GGTTTGACCT TGTTGTGAGT TGGGATAGAG 
CCAGATAAGA AAACTAAATA 

2 731 AAGGGATTTT GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA 
CAAAAATTTA ACGCGAATTT 

TTCCCTAAAA CGGCTAAAGC CGGATAACCA ATTTTTTACT CGACTAAATT 
GTTTTTAAAT TGCGCTTAAA 

2801 TAACAAAATA TTAACGCTTA CAATTTAGGT GGCACTTTTC GGGGAAATGT 
GCGCGGAACC CCTATTTGTT 

ATTGTTTTAT AATTGCGAAT GTTAAATCCA CCGTGAAAAG CCCCTTTACA 
CGCGCCTTGG GGATAAACAA 

2 8 71 TATTTTTCTA AATACATTCA AATATGTATC CGCTCATGAG ACAATAACCC 
TGATAAATGC TTCAATAATA 



ACTATTTACG AAGTTATTAT 

2 941 TTGAAAAAGG AAGAGTATGA GTATTCAACA TTTCCGTGTC GCCCTTATTC 
CCTTTTTTGC GGCATTTTGC 

AACTTTTTCC TTCTCATACT CATAAGTTGT AAAGGCACAG CGGGAATAAG 
GGAAAAAACG CCGTAAAACG 

3 011 CTTCCTGTTT TTGCTCACCC AGAAACGCTG GTGAAAGTAA AAGATGCTGA 
AGATCAGTTG GGTGCACGAG 

GAAGGACAAA AACGAGTGGG TCTTTGCGAC CACTTTCATT TTCTACGACT 
TCTAGTCAAC CCACGTGCTC 

3 081 TGGGTTACAT CGAACTGGAT CTCAACAGCG GTAAGATCCT TGAGAGTTTT 
CGCCCCGAAG AACGTTTTCC 



ATAAAAAGAT TTATGTAAGT TTATACATAG GCGAGTACTC TGTTATTGGG 




ACCCAATGTA GCTTGACCTA GAGTTGTCGC CATTCTAGGA ACTCTCAAAA 
GCGGGGCTTC TTGCAAAAGG 

3151 AATGATGAGC ACTTTTAAAG TTCTGCTATG TGGCGCGGTA TTATCCCGTA 
TTGACGCCGG GCAAGAGCAA 

TTACTACTCG TGAAAATTTC AAGACGATAC ACCGCGCCAT AATAGGGCAT 
AACTGCGGCC CGTTCTCGTT 

3221 CTCGGTCGCC GCATACACTA TTCTCAGAAT GACTTGGTTG AGTACTCACC 
AGT CACAGAA AAGCATCTTA 

GAGCCAGCGG CGTATGTGAT AAGAGTCTTA CTGAACCAAC TCATGAGTGG 

TCAGTGTCTT TTCGTAGAAT 

32 91 CGGATGGCAT GACAGTAAGA GAATTATGCA GTGCTGCCAT AACCATGAGT 
GATAACACTG CGGCCAACTT 

GCCTACCGTA CTGTCATTCT CTTAATACGT CACGACGGTA TTGGTACTCA 
CTATTGTGAC GCCGGTTGAA 

33 61 ACTTCTGACA ACGATCGGAG GACCGAAGGA GCTAACCGCT TTTTTGCACA 
ACATGGGGGA TCATGTAACT 

TGAAGACTGT TGCTAGCCTC CTGGCTTCCT CGATTGGCGA AAAAACGTGT 

TGTACCCCCT AGTACATTGA 

3 431 CGCCTTGATC GTTGGGAACC GGAGCTGAAT GAAGCCATAC CAAACGACGA 
GCGTGACAGC ACGATGCCTG 

GCGGAACTAG CAACCCTTGG CCTCGACTTA CTTCGGTATG GTTTGCTGCT 

CGCACTGTGG TGCTACGGAC 

3 501 TAGCAATGGC AACAACGTTG CGCAAACTAT TAACTGGCGA ACTACTTACT 
CTAGCTTCCC GGCAACAATT 

ATCGTTACCG TTGTTGCAAC GCGTTTGATA ATTGACCGCT TGATGAATGA 
GATCGAAGGG CCGTTGTTAA 

3 571 AATAGAC TGG ATGGAGGCGG ATAAAGTTGC AGGACCACTT CTGCGCTCGG 
CCCTTCCGGC TGGCTGGTTT 

TTATCTGACC TACCTCCGCC TATTTCAACG TCCTGGTGAA GACGGGAGCC 
GGGAAGGCCG ACCGACCAAA 

3 641 ATTGCTGATA AATCTGGAGC CGGTGAGCGT GGGTCTCGCG GTATCATTGC 

AGCACTGGGG CCAGATGGTA 

TAACGACTAT TTAGACCTCG GCCACTCGCA CCCAGAGCGC CATAGTAACG 

TCGTGACCCC GGTCTACCAT 

3 711 AGCCCTCCCG TATCGTAGTT ATCTACACGA CGGGGAGTGA GGCAACTATG 
GATGAACGAA ATAGACAGAT 

TCGGGAGGGC ATAGCATCAA TAGATGTGCT GCCCCTCAGT CCGTTGATAC 

CTACTTGCTT TATCTGTCTA 

3 781 CGCTGAGATA GGTGCCTCAC TGATTAAGCA TTGGTAACTG TCAGACCAAG 
T TTAC T CAT A TATACTTTAG 

GCGACTCTAT CCACGGAGTG ACTAATTCGT AACCATTGAC AGTCTGGTTC 
AAATGAGTAT ATATGAAATC 



3 851 ATTGATTTAA AACTTCATTT TTAATTTAAA AGGATCTAGG TGAAGATCCT 
TTTTGATAAT CTCATGACCA 

TAACTAAATT TTGAAGTAAA AATTAAATTT TCCTAGATCC ACTTCTAGGA 
AAAACTATTA GAGTACTGGT 

3 921 AAATCCCTTA ACGTGAGTTT TCGTTCCACT GAGCGTCAGA CCCCGTAGAA 
AAGATCAAAG GATCTTCTTG 

TTTAGGGAAT TGCACTCAAA AGCAAGGTGA CTCGCAGTCT GGGGCATCTT 
TTCTAGTTTC CTAGAAGAAC 

3 991 AGATCCTTTT TTTCTGCGCG TAATCTGCTG CTTGCAAACA AAAAAACCAC 
CGCTACCAGC GGTGGTTTGT 

TCTAGGAAAA AAAGACGCGC ATTAGACGAC GAACGTTTGT TTTTTTGGTG 
GCGATGGTCG CCACCAAACA 

40 61 TTGCCGGATC AAGAGCTACC AACTCTTTTT CCGAAGGTAA CTGGCTTCAG 
CAGAGCGCAG ATACCAAATA 

AACGGCCTAG TTCTCGATGG TTGAGAAAAA GGCTTCCATT GACCGAAGTC 

GTCTCGCGTC TATGGTTTAT 

4131 CTGTCCTTCT AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA 
GCACCGCCTA CATACCTCGC 

GACAGGAAGA TCACATCGGC ATCAATCCGG TGGTGAAGTT CTTGAGACAT 
CGTGGCGGAT GTATGGAGCG 

42 01 TCTGCTAATC CTGTTACCAG TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC 
TTACCGGGTT GGACTCAAGA 

AGACGATTAG GACAATGGTC ACCGACGACG GTCACCGCTA TTCAGCACAG 
AATGGCCCAA CCTGAGTTCT 

4271 CGATAGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG 
CACACAGCCC AGCTTGGAGC 

GCTATCAATG GCCTATTCCG CGTCGCCAGC CCGACTTGCC CCCCAAGCAC 

GTGTGTCGGG TCGAACCTCG 

4341 GAACGACCTA CACCGAACTG AGATACCTAC AGCGTGAGCT ATGAGAAAGC 
GCCACGCTTC CCGAAGGGAG 

CTTGCTGGAT GTGGCTTGAC TCTATGGATG TCGCACTGGA TACTCTTTCG 

CGGTGCGAAG GGCTTCCCTC 

4411 AAAGGCGGAC AGGTATCCGG TAAGCGGCAG GGTCGGAACA GGAGAGCGCA 
CGAGGGAGCT TCCAGGGGGA 

TTTCCGCCTG TCCATAGGCC ATTCGCCGTC CCAGGCTTGT CCTCTCGCGT 
GCTCCCTGGA AGGTCCCCCT 

4481 AACGCCTGGT ATCTTTATAG TCCTGTCGGG TTTCGCCACC TCTGACTTGA 
GCGTCGATTT TTGTGATGCT 

TTGCGGACCA TAGAAAT AT C AGGACAGCCC AAAGCGGTGG AGACTGAACT 

CGCAGCTAAA AACACTACGA 

4551 CGTCAGGGGG GCGGAGCCTA TGGAAAAACG CCAGCAACGC GGCCTTTTTA 
CGGTTCCTGG CCTTTTGCTG 

GCAGTCCCCC CGCCTCGGAT ACCTTTTTGC GGTCGTTGCG CCGGAAAAAT 

GCCAAGGACC GGAAAACGAC 



4621 GCCTTTTGCT CACATGTTCT TTCCTGCGTT ATCCCCTGAT TCTGTGGATA 
ACCGTATTAC CGCCTTTGAG 

CGGAAAACGA GTGTACAAGA AAGGACGCAA TAGGGGACTA AGACACCTAT 
TGGCATAATG GCGGAAACTC 

46 91 TGAGCTGATA CCGCTCGCCG CAGCCGAACG ACCGAGCGCA GCGAGTCAGT 
GAGCGAGGAA GCGGAAGAGC 

ACTCGACTAT GGCGAGCGGC GTCGGCTTGC TGGCTCGCGT CGCTCAGTCA 
CTCGCTCCTT CGCCTTCTCG 

4761 GCCCAATACG CAAACCGCCT CTCCCCGCGC GTTGGCCGAT TCATTAATGC 
AGCTGGCACG ACAGGTTTCC 

CGGGTTATGC GTTTGGCGGA GAGGGGCGCG CAACCGGCTA AGTAATTACG 

TCGACCGTGC TGTCCAAAGG 

4831 CGACTGGAAA GCGGGCAGTG AGCGCAACGC AATTAATGTG AGTTAGCTCA 
CTCATTAGGC ACCCCAGGCT 

GCTGACCTTT CGCCCGTCAC TCGGGTTGCG TTAATTACAC TCAATCGAGT 

GAGTAATCCG TGGGGTCCGA 

49 01 TTACACTTTA TGCTTCCGGC TCGTATGTTG TGTGGAATTG TGAGCGGATA 
ACAATTTCAC ACAGGAAACA 

AATGTGAAAT ACGAAGGCCG AGCATACAAC ACACCTTAAC ACTCGCCTAT 
TGTTAAAGTG TGTCCTTTGT 



4 971 GCTATGACCA TGATTACGCC AAGCGCGCAA TTAACCCTCA CTAAAGGGAA 
CAAAAGCTGG GTAC 

CGATACTGGT ACTAATGCGG TTCGCGCGTT AATTGGGAGT GATTTCGCTT 

GTTTTCGACC CATG 



Ncol 



BssHII 



Kpnl 




Hin&lll (2) 
Bgl II (4831) Ncol(9) 
B am HI (4799) M>fl(129) 




Bss HII (1272) 

fl ori 

amp 



Fr v ^3 A- 



Sequence for p J06 



Hindi I I Ncol 



1 AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT 
GGCCACGGCC GCCGTCGCCG 

TTCGAATGGT ACGGGGTGCG GGCGCAGGAG GAGGACCGCG AGCGGCACGA 
CCGGTGCCGG CGGCAGCGGC 



Not I 



71 TCGCCTCCTC CTCCTCCTTC GCCGACTCCA ACCCGATCCG GCCCGTCACC 
GACCGCGCGG CCGCCTCCAC 

AGCGGAGGAG GAGGAGGAAG CGGCTGAGGT TGGGCTAGGC CGGGCAGTGG 
CTGGCGCGCC GGCGGAGGTG 

141 GCAGGGCATC TCCGAAGACC TCTACAGCCG TTTAGTCGAA ATGGCCACTA 
TCTCCCAAGC TGCCTACGCC 

CGTCCCGTAG AGGCTTCTGG AGATGTCGGC AAATCAGCTT TACCGGTGAT 
AGAGGGTTCG ACGGATGCGG 

Sail 



AccI 



211 GACCTGTGCA ACATTCCGTC GACTATTATC AAGGGAGAGA AAATTTACAA 
TTCTCAAACT GACATTAACG 

CTGGACACGT TGTAAGGCAG CTGATAATAG TTCCCTGTCT TTTAAATGTT 
AAGAGTTTGA CTGTAATTGC 

BamHI 



281 GATGGATCGT CCGCGACGAC AGCAGCAAAG AAATAATCAC CGTCTTCCGT 
GGCACTGGTA GTGATACGAA 

CTACCTAGGA GGCGCTGCTG TCGTCGTTTC TTTATTAGTG GCAGAAGGCA 

CCGTGACCAT CACTATGCTT 

3 51 TCTACAACTC GATACTAACT ACACCCTGAC GCCTTTCGAC ACCCTACCAC 
AATGCAACGG TTGTGAAGTA 

AGATGTTGAG CTATGATTGA TGTGGGAGTG CGGAAAGCTG TGGGATGGTG 
TTACGTTGCC AACACTTCAT 

421 GACGGTGGAT ATTATATTGG ATGGGTCTCC GTCCAGGACC AAGTCGAGTC 
GCTTGTCAAA CAGCAGGTTA 

GTGCCACCTA TAATATAACC TACCCAGAGG CAGGTCCTGG TTCAGCTCAG 
CGAACAGTTT GTCGTCCAAT 

491 GCCAGTATCC GGACTACGCG CTGACCGTGA CCGGCCACKC CCTCGGCGCC 
TCCCTGGCGG CACTCACTGC 

CGGTCATAGG CCTGATGGGC GACTGGCACT GGCCGGTGMG GGAGCCGCGG 
AGGGACCGCC GTGAGTGACG 



561 CGCCCAGCTG TCTGCGACAT ACGACAACAT CCGCCTGTAC ACCTTCGGCG 
AACCGCGCAG CGGCAATCAG 

GCGGGTCGAC AGACGCTGTA TGCTGTTGTA GGCGGACATG TGGAAGCCGC 

TTGGCGCGTC GCCGTTAGTC 

Xhol 



631 GCCTTCGCGT CGTACATGAA CGATGCCTTC GAAGCCTCGA GCCCAGATAC 
GACGCAGTAT TTCCGGGTCA 

CGGAAGCGCA GCATGTACTT GCTACGGAAG GTTCGGAGCT CGGGTCTATG 

CTGCGTCATA AAGGCCCAGT 



Ncol 



701 CTCATGCCAA CGACGGCATC CCAAACCTGC CCCCGGTGGA GCAGGGGTAC 
GCCCATGGCG GTGTAGAGTA 

GAGTACGGTT GCTGCCGTAG GGTTTGGACG GGGGCCACCT CGTCCCCATG 

CGGGTACGGC CACATCTCAT 

771 CTGGAGCGTT GATCCTTACA GCGCCCAGAA CACATTTGTC TGCACTGGGG 
ATGAAGTGCA GTGCTGTGAG 

GACCTCGCAA CTAGGAATGT CGCGGGTCTT GTGTAAACAG ACGTGACCCC 

TACTTCACGT CAGGACACTC 



SphI 



841 GCCCAGGGCG GACAGGGTGT GAATAATGCG CACACGACTT ATTTTGGGAT 
GACGAGCGGC GCATGCACCT 

CGGGTCCCGC CTGTCCCACA CTTATTACGC GTGTGCTGAA TAAAACC CTA 

CTGCTCGCCG CGTACGTGGA 



Not I 



911 GGCCGGTCGC GGCCGCGGAA ACCACTGAAG GATGAGCTGT AAAGAAGCAG 
ATCGTTCAAA CATTTGGCAA 

CCGGCCAGCG CCGGCGCCTT TGGTGACTTC CTACTCGACA TTTCTTCGTC 
TAGCAAGTTT GTAAACCGTT 

981 TAAAGTTTCT TAAGATTGAA TCCTGTTGCC GGTCTTGCGA TGATTATCAT 
ATAATTTCTG TTGAATTACG 

ATTTCAAAGA ATTCTAACTT AGGACAACGG CCAGAAGGCT AGTAATAGTA 
TATTAAAGAC AACTTAATGC 

1051 TTAAGCATGT AATAATTAAC ATGTAATGCA TGACGTTATT TATGAGATGG 
GTTTTTATGA TTAGAGTCCC 

AATTCGTACA TTATTAATTG TACATTACGT ACTGCAATAA ATACTCTACC 

CAAAAATACT AATCTCAGGG 



BssHII 



BssHII 



1121 GCAATTATAC ATTTAATACG CGATAGAAAA CAAAATATAG CGCGCAAACT 
AGGATAAATT ATCGCGCGCG 

CGTTAATATG TAAATTATGC GCTATCTTTT GTTTTATATC GCGCGTTTGA 
TCCTATTTAA TAGCGCGCGC 



1191 GTGTCATCTA TGTTACTAGA TCGATAAGCT TCTAGAGCGG CCGGTGGAGC 
TCCAATTCGC CCTATAGTGA 

CACAGTAGAT ACAATGATCT AGCTATTCGA AGATCTCGCC GGCCACCTCG 
AGGTTAAGCG GGATATCACT 



1261 GTCGTATTAC GCGCGCTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG 
AAAACCCTGG CGTTACCCAA 

CAGCATAATG CGCGCGAGTG ACCGGCAGCA AAATGTTGCA GCACTGACCC 
TTTTGGGACC GCAATGGGTT 

13 31 CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA 
AGAGGC CCGC ACCGATCGCC 

GAATTAGCGG AACGTCGTGT AGGGGGAAAG CGGTCGACCG CATTATCGCT 

TCTCCGGGCG TGGCTAGCGG 

1401 CTTCCCAACA GTTGGGCAGG CTGAATGGCG AATGGGACGC GCGCTGTAGC 
GGCGCATTAA GCGCGGCGGG 

GAAGGGTTGT CAACGCGTCG GACTTACCGC TTACCCTGCG CGGGACATCG 

GCGCGTAATT CGCGCCGCCC 

1471 TGTGGTGGTT ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC 
CCGCTCCTTT CGCTTTCTTC 

ACACCACCAA TGCGCGTCGC ACTGGCGATG TGAACGGTCG CGGGATCGCG 

GGCGAGGAAA GCGAAAGAAG 

1541 CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG 
GGGGCTCCCT TTAGGGTTCC 

GGAAGGAAAG AGCGGTGCAA GCGGCGGAAA GGGGCAGTTC GAGATTTAGC 
CCCCGAGGGA AATCCCAAGG 

1611 GATTTAGTGC TTTACGGCAC CTCGACCCCA AAAAACTTGA TTAGGGTGAT 
GGTTCACGTA GTGGGCCATC 

CTAAATCACG AAATGCCGTG GAGCTGGGGT TTTTTGAACT AATCCCACTA 
CCAAGTGCAT CACCCGGTAG 

1681 GCCCTGATAG ACGGTTTTTC GCCCTTTGAC GTTGGAGTCC ACGTTCTTTA 
ATAGTGGACT CTTGTTCCAA 

CGGGACTATC TGCCAAAAAG CGGGAAACTG CAACCTCAGG TGCAAGAAAT 
TATCACCTGA GAACAAGGTT 



Xbal 



Clal Hindi II 



BssHII 




1751 ACTGGAACAA CACTCAACCC TATCTCGGTC TATTCTTTTG ATTTATAAGG 
GATTTTGCCG ATTTCGGCCT 

TGACCTTGTT GTGAGTTGGG ATAGAGCCAG ATAAGAAAAC TAAATATTCC 
CTAAAACGGC TAAAGCCGGA 

1821 ATTGGTTAAA AAATGAGCTG ATTTAACAAA AATTTAACGC GAATTTTAAC 
AAAATATTAA CGGTTACAAT 

TAAG CAATTT TTTACTCGAC TAAATTGTTT TTAAATTGCG CTTAAAATTG 
TTTTATAATT GCGAATGTTA 

1891 TTAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT 
TTTCTAAATA CATTCAAATA 

AATCCACCGT GAAAAGCCCC TTTACACGCG CCTTGGGGAT AAACAAATAA 

AAAGATTTAT GTAAGTTTAT 

1961 TGTATCCGCT CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA 
AAAAGGAAGA GTATGAGTAT 

ACATAGGCGA GTACTCTGTT ATTGGGACTA TTTACGAAGT TATTATAACT 
TTTTCCTTCT CATACTCATA 

2 031 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC 
CTGTTTTTGC TCACCCAGAA 

AGTTGTAAAG GCACAGCGGG AATAAGGGAA AAAACGCCGT AAAACGGAAG 
GACAAAAACG AGTGGGTCTT 

2101 ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG 
TTACATCGAA CTGGATCTCA 

TGCGACCACT TTCATTTTCT ACGACTTCTA GTCAACCCAC GTGCTCACCC 
AATGTAGCTT GACCTAGAGT 

2171 ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG TTTTCCAATG 
ATGAGCACTT TTAAAGTTCT 

TGTCGCCATT CTAGGAACTC TCAAAAGCGG GGCTTCTTGC AAAAGGTTAC 

TACTCGTGAA AATTT CAAGA 

2241 GCTATGTGGC GCGGTATTAT CCCGTATTGA CGCCGGGCAA GAGCAACTCG 
GTCGCCGGAT ACACTATTCT 

CGATACACCG CGCCATAATA GGGCATAACT GCGGCCCGTT CTCGTTGAGC 
CAGCGGCGTA TGTGATAAGA 

2311 CAGAATGACT TGGTTGAGTA CTCACCAGTC AC AGAAAAG C ATCTTACGGA 
TGGCATGACA GTAAGAGAAT 

GTCTTACTGA ACCAACTCAT GAGTGGTCAG TGTCTTTTCG TAGAATGCCT 
ACCGTACTGT CATTCTCTTA 

23 81 TATGCAGTGC TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT 
CTGACAACGA TCGGAGGACC 

ATACGTCACG ACGGTATTGG TACTCACTAT TGTGACGCCG GTTGAATGAA 
GACTGTTGCT AGCCTCCTGG 

2451 GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT GTAACTCGCC 
TTGATCGTTG GGAACCGGAG 




CTTCCTCGAT TGGCGAAAAA ACGTGTTGTA CCCCCTAGTA CATTGAGCGG 
AACTAGCAAC CCTTGGCCTC 



2 521 CTGAATGAAG CCATACCAAA CGACGAGCGT GACACCACGA TGCCTGTAGC 
AATGGCAACA ACGTTGCGCA 

GACTTACTTC GGTATGGTTT GCTGCTCGCA CTGTGGTGCT ACGGACATCG 
TTACCGTTGT TGCAACGCGT 

2591 AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA 
GACTGGATGG AGGCGGATAA 

TTGATAATTG ACCGCTTGAT GAATGAGATC GAAGGGCCGT TGTTAATTAT 
CTGACCTACC TCCGCCTATT 

2 661 AGTTGCAGGA CCACTTCTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG 
CTGATAAATC TGGAGCCGGT 

TCAACGTCCT GGTGAAGACG CGAGCCGGGA AGGCCGACCG ACCAAATAAC 
GACTATTTAG ACCTCGGCCA 

2 731 GAGCGTGGGT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC 
CTCCCGTATC GTAGTTATCT 

CTCGCACCCA GAGCGCCATA GTAACGTCGT GACCCCGGTC TACCATTCGG 
GAGGGCATAG CATCAATAGA 

2 801 ACACGACGGG GAGTCAGGGA ACTATGGATG AACGAAATAG ACAGATCGCT 
GAGATAGGTG CCTCACTGAT 

TGTGCTGCCC CTCAGTCCGT TGATACCTAC TTGCTTTATC TGTCTAGCGA 
CTCTATCCAC GGAGTGACTA 

2 871 TAAGCATTGG TAACTGTCAG ACCAAGTTTA CTCATATATA CTTTAGATTG 
ATTTAAAACT TCATTTTTAA 

ATTCGTAACC ATTGACAGTC TGGTTCAAAT GAGTATATAT GAAATCTAAC 
TAAATTTTGA AGTAAAAATT 

2 941 TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 
CGCTTAACGT GAGTTTTCGT 

AAATTTTCCT AGATCCACTT CTAGGAAAAA CTATTAGAGT ACTGGTTTTA 
GGGAATTGCA CTCAAAAGCA 

3 011 TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 
CCTTTTTTTC TGGGCGTAAT 

AGGTGACTCG CAGTCTGGGG CATCTTTTCT AGTTTCCTAG AAGAACTCTA 
GGAAAAAAAG ACGCGCATTA 

3 081 CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC 
CGGATCAAGA GCTACCAACT 

GACGACGAAC GTTTGTTTTT TTGGTGGCGA TGGTCGCCAC CAAACAAACG 
GCCTAGTTCT CGATGGTTGA 

3151 CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC CAAATACTGT 
CCTTCTAGTG TAGCCGTAGT 

GAAAAAGGCT TCCATTGACC GAAGTCGTCT CGCGTCTATG GTTTATGACA 
GGAAGATCAC ATCGGCATCA 




3221 TAGGCCACCA CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG 
CTAATCCTGT TACCAGTGGC 

ATCCGGTGGT GAAGTTCTTG AGACATCGTG GCGGATGTAT GGAGCGAGAC 

GATTAGGACA ATGGTCACCG 

3 2 91 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT 
AGTTACCGGA TAAGGCGCAG 

ACGACGGTCA CCGCTATTCA GCACAGAATG GCCCAACCTG AGTTCTGCTA 
TCAATGGCCT ATTCCGCGTC 

33 61 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 
GACCTACACC GAACTGAGAT 

GCCAGCGCGA CTTGCCCCCC AAGCACGTGT GTCGGGTCGA ACCTCGCTTG 
CTGGATGTGG CTTGACTCTA 

3431 ACCTACAGCG TGAGCTATGA GAAAGGGCCA CGCTTCCCGA AGGGAGAAAG 

GCGGACAGGT ATCCGGTAAG 

TGGATGTCGC ACTCGATACT CTTTCGCGGT GCGAAGGGCT TCCCTCTTTC 

CGGCTGTCCA TAGGCCATTC 

3 501 CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA GGGGGAAACG 
CCTGGTATCT TTATAGTCCT 

GCCGTCCCAG CCTTGTCCTC TCGCGTGCTC CCTCGAAGGT CCCCCTTTGC 
GGACCATAGA AATATCAGGA 

3 571 GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT GATGCTCGTC 
AGGGGGGCGG AGCCTATGGA 

GAGCCCAAAG CGGTGGAGAC TGAACTCGCA GCTAAAAACA GTACGAGCAG 

TCCCCCCGCC TCGGATACCT 

3 641 AAAACGCCAG CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT 
TTTGCTCACA TGTTCTTTCC 

TTTTGCGGTC GTTGCGCCGG AAAAATGCCA AGGACCGGAA AACGACCGGA 

AAACGAGTGT ACAAGAAAGG 

3 711 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG 
CTGATACCGC TCGCCGCAGC 

ACGCAATAGG GGACTAAGAC ACCTATTGGC ATAATGGCGG AAACTCACTC 

GACTATGGCG AGCGGCGTCG 

3 781 CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC 
AATACGCAAA CCGCCTCTCC 

GCTTGCTGGC TCGCGTCGCT CAGTCACTCG CTCCTTCGCC TTCTCGCGGG 

TTATGCGTTT GGGGGAGAGG 

3 851 CCGCGCGTTG GCCGATTCAT TAATGCAGCT GGCACGACAG GTTTCCCGAG 
TGGAAAGCGG GCAGTGAGCG 

GGCGCGCAAC CGGGTAAGTA ATTACGTCGA CCGTGCTGTC CAAAGGGCTG 

ACCTTTCGCC CGTCACTCGC 

3 921 CAACGCAATT AATGTGAGTT AGCTCACTCA TTAGGCACCC CAGGCTTTAC 
ACTTTATGCT TCCGGCTCGT 

GTTGCGTTAA TTACACTCAA TCGAGTGAGT AATCCGTGGG GTCCGAAATG 

TGAAATACGA AGGCCGAGCA 



BssHII 



3 991 ATGTTGTGTG GAATTGTGAG CGGATAACAA TTTCACACAG GAAACAGCTA 
TGACCATGAT TACGCCAAGC 

TACAACACAC CTTAACACTC GCCTATTGTT AAAGTGTGTC CTTTGTCGAT 

ACTGGTACTA ATGCGGTTCG 

BssHII EcoRI 



4 061 GCGCAATTAA CCCTCACTAA AGGGAACAAA AGCTGGAATT CCACAATGAA 

CAATAATAAG ATTAAAATAG 

CGCGTTAATT GGGAGTGATT TCCCTTGTTT TCGACCTTA& GGTGTTACTT 

GTTATTATTC TAATTTTATC 

4131 CTTGCCCCCG TTGCAGCGAT GGGTATTTTT TCTAGTAAAA TAAAAGATAA 

ACTTAGACTC AAAACATTTA 

GAACGGGGGC AACGTCGCTA CCCATAAAAA AGATCATTTT ATTTTCTATT 

TGAATCTGAG TTTTGTAAAT 

42 01 CAAAAACAAC CCCTAAAGTC CTAAAGCCCA AAGTGCTATG CACGATCCAT 
AGCAAGCCCA GCCCAACCCA 

GTTTTTGTTG GGGATTTCAG GATTTCGGGT TTCACGATAG GTGCTAGGTA 

TCGTTCGGGT CGGGTTGGGT 

4271 ACCCAACCCA ACCCACCCCA GTGCAGCCAA CTGGCAAATA GTCTCCACCC 
CCGGCACTAT CACCGTGAGT 

TGGGTTGGGT TGGGTGGGGT CACGTCGGTT GACCGTTTAT CAGAGGTGGG 

GGC CGTGATA GTGGCACTCA 

4341 TGTCCGCACC ACCGCACGTC TCGCAGCCAA AAAAAAAAAA AGAAAGAAAA 
AAAAGAAAAA GAAAAACAGC 

ACAGGCGTGG TGGCGTGCAG AGCGTCGGTT TTTTTTTTTT TCTTTCTTTT 

TTTTCTTTTT CTTTTTGTCG 

4411 AGGTGGGTCC GGGTCGTGGG GGCCGGAAAA G C GAGGAGGA TCGCGAGCAG 

CGACGAGGCC CGGCCCTCCC 

TCCACCCAGG CGCAGCACCC CCGGCCTTTT CGCTCCTCCT AGCGCTCGTC 

GCTGCTCCGG GCCGGGAGGG 

4481 TCCGCTTCCA AAGAAACGCC CCCCATCGCC ACTATATACA TACCCCCCCC 
TCTCCTCCCA TCCCCCCAAC 

AGGGGAAGGT TTCTTTGCGG GGGGTAGCGG TGATATATGT ATGGGGGGGG 

AGAGGAGGGT AGGGGGGTTG 

4 551 CCTACCACCA CCACCACCAC CACCTCCTCC CCCCTCGCTG CCGGACGACG 
AGCTCCTCCC CCCTCCCCCT 

GGATGGTGGT GGTGGTGGTG GTGGAGGAGG GGGGAGCGAC GGCCTGCTGC 

TCGAGGAGGG GGGAGGGGGA 

4 621 CCGCCGCCGC CGGTAACCAC CCCGCCCCTC TCCTCTTTCT TTCTCCGTTT 
TTTTTTTCGT CTCGGTCTCG 



GGCGGCGGCG GCCATTGGTG 
AAAAAAAGCA GAGCCAGAGC 

4 691 ATCTTTGGCC TTGGTAGTTT 
TCGGTGCGCG GGAGGGGCGG 

TAGAAACCGG AAC CAT C AAA 
AGCCACGCGC CCTCCCCGCC 



Bglll 



4761 GATCTCGCGG CTGGCGTCTC 
GAATGGGGCT CTCGGATGTA 

CTAGAGCGCC GACCGCAGAG 
CTTACCCCGA GAGCCTACAT 

Bglll 



4831 GATCTTCTTT CTTTCTTCTT 
GTTCATCGGT AGTTTTTCTT 

CTAGAAGAAA GAAAGAAGAA 
CAAGTAGCCA TCAAAAAGAA 

4 901 TTCATGATTT GTGACAAATG 
AAGTACTAAA CACTGTTTAC 



GGGCGGGGAG AGGAGAAAGA AAGAGGCAAA 

GGGTGGGCGA GAGCGGCTTC GTCGCGCAGA 
CCCACCCGCT CTCGCCGAAG CAGCGGGTCT 

BamHI 



CGGGCGTGAG TCGGCCCGGA TCCTCGCGGG 
GCCCGCACTC AGCCGGGCCT AGGAGCGCCC 

TTTGTGGTAG AATTTGAATC CCTCAGCATT 
AAACACCATC TTAAACTTAG GGAGTCGTAA 

CAGCCTCGTG CGGAGCTTTT TTGTAGGTAG 
GTCGGAGCAC GCCTCGAAAA AA CATC CATC 



Bss HII (4938) 



EcoW{\) 



:amp 





pJQ4 

4974 bp 



f1 on 



Actin promoter 

Bam HI (703) 
Bglll (735) 
ffindlll (856) 
PPI (APO) 




NcoI(938) 
M>fl(1007) 
5o/I(1107) 
iccl(1108) 
Bam HI (1163) 
SIG-VAC-FAE-F/S 
Xho I (1545) 
to I (1632) 
5/?/i I (1784) 
tfofI(1799) 
Bss HII (2039) 
5s j HII (2063) 
C/a I (2090) 
tfmdlll (2095) 
Xba\{2M) 
BssYiW (2150) 



Sequence for pJQ4 



EcoRI 



1 AATTCCACAA TGAACAATAA TAAGATTAAA ATAGCTTGCC CCCGTTGCAG 
CGATGGGTAT TTTTTCTAGT 

TTAAGGTGTT ACTTGTTATT ATTCTAATTT TATCGAACGG GGGCAACGTC 
GCTACCCATA AAAAAGATCA 

71 AAAATAAAAG ATAAACTTAG ACTCAAAACA TTTACAAAAA GAACCCCTAA 
AGTCCTAAAG CCCAAAGTGC 

TTTTATTTTC TATTTGAATC TGAGTTTTGT AAATGTTTTT GTTGGGGATT 
TCAGGATTTC GGGTTTCACG 

141 TATGCACGAT CCATAGCAAG CCCAGCCCAA CCCAACCCAA CCCAACCCAC 
CCCAGTGCAG CCAACTGGCA 

ATACGTGCTA GGTATCGTTC GGGTCGGGTT GGGTTGGGTT GGGTTGGGTG 
GGGTCACGTC GGTTGACCGT 

211 AATAGTCTCC ACCCCCGGCA CTATCACCGT GAGTTGTCCG CACCACCGCA 
CGTCTCGCAG CCAAAAAAAA 

TTATCAGAGG TGGGGGCCGT GATAGTGGCA CTCAACAGGC GTGGTGGCGT 
GCAGAGCGTC GGTTTTTTTT 

281 AAAAAGAAAG AAAAAAAAGA AAAAGAAAAA CAGCAGGTGG GTCCGGGTCG 
TGGGGGCCGG AAAAGCGAGG 

TTTTTCTTTC TTTTTTTTCT TTTTCTTTTT GTCGTCCACC CAGGCCCAGC 
ACCCCCGGCC TTTTCGCTCC 

351 AGGATCGCGA GCAGCGACGA GGCCCGGCCC TCCCTCCGCT TCCAAAGAAA 
CGCCCCCCAT CGCCACTATA 

TCCTAGCGGT CGTCGCTGCT CCGGGCCGGG AGGGAGG CGA AGGTTTCTTT 
GCGGGGGGTA GCGGTGATAT 

421 TACATACCCC CCCCTCTCCT CCCATCCCCC CAACCCTACC ACCACCACCA 
CCACCACCTC CTCCCCCCTC t 

ATGTATGGGG GGGGAGAGGA GGGTAGGGGG GTTGGGATGG TGGTGGTGGT 
GGTGGTGGAG GAGGGGGGAG 

491 GCTGCCGGAC GACGAGCTCC TCCCCCCTCC CCCTCCGCGG CCGCCGGTAA 
CCACCCCGCC CCTCTCCTCT 

CGACGGCCTG CTGCTCGAGG AGGGGGGAGG GGGAGGCGGC GGCGGCCATT 
GGTGGGGCGG GGAGAGGAGA 

561 TTCTTTCTCC GTTTTTTTTT TCGTCTCGGT CTCGATCTTT GGCCTTGGTA 
GTTTGGGTGG GCGAGAGCGG 

AAGAAAGAGG CAAAAAAAAA AGCAGAGCCA GAGCTAGAAA CCGGAACCAT 
CAAACCCACC CGCTCTCGCC 

631 CTTCGTCGCC CAGATCGGTG CGCGGGAGGG GCGGGATCTC GCGGCTGGCG 
TCTCCGGGCG TGAGTGGGCC 




GAAGCAGCGG GTCTAGCCAC GCGCCCTCCC CGCCCTAGAG CGCCGACGGC 
AGAGGCCCGC ACTCAGCCGG 



BamHI Bglll 

701 CGGATCCTCG CGGGGAATGG GGCTCTCGGA TGTAGATCTT CTTTCTTTCT 
TCTTTTTGTG GTAGAATTTG 

GCCTAGGAGC GCCCCTTACC CCGAGAGCCT AC AT C T AGAA GAAAGAAAGA 

AGAAAAACAC CATCTTAAAC 

771 AATCCCTCAG CATTGTTCAT CGGTAGTTTT TCTTTTCATG ATTTGTGACA 

AATGCAGCCT CGTGCGGAGC 

TTAGGGAGTC GTAACAAGTA GC CAT CAAAA AGAAAAGTAC TAAACACTGT 

TTACGTCGGA GCACGCCTCG 

Hindi I I 



841 TTTTTTGTAG GTAGAAGCTT ACMATGGMCG TGCACAAGGA GGTSAACTTC 
GTSGCCTACC TCCTGATCGT 

AAAAAACATC CATCTTCGAA TGKTACCKGC ACGTGTTCCT CCASTTGAAG 

CASCGGATGG AGGAC TAGCA 



Ncol 



911 SCTCGGCCTC CTCTTGCTCG TSTCCGCCAT GGAGCACGTG GACGCCAAGG 
CCTGCACCCK CGAGTGCGGC 

SGAGCCGGAG GAGAACGAGC ASAGGCGGTA CCTCGTGCAC CTGCGGTTCC 
GGACGTGGGM GCTCACGCCG 



Not I 



9 81 AACCTCGGCT TCGGCATCTG CCCGGCGGCC GCCTCCACGC AGGGCATCTC 
CGAAGACCTC TACAGCCGTT 

TTGGAGCCGA AGCCGTAGAC GGGCCGCCGG CGGAGGTGCG TCCCGTAGAG 
GCTTCTGGAG ATGTCGGCAA 



Sail 



AccI 



1051 TAGTCGAAAT GGCCACTATC TCCCAAGCTG CCTACGCCGA CCTGTGCAAC 
ATTCCGTCGA CTATTATCAA 

ATCAGCTTTA CCGGTGATAG AGGGTTCGAC GGATGCGGCT GGACACGTTG 
TAAGGCAGCT GATAATAGTT 

BamHI 

1121 GGGAGAGAAA ATTTACAATT CTCAAACTGA CATTAACGGA TGGATCCTCC 
GCGACGACAG CAGCAAAGAA 



CCCTCTCTTT TAAATGTTAA GAGTTTGACT GTAATTGCCT ACCTAGGAGG 
CGCTGCTGTC GTCGTTTCTT 



1191 ATAATCACCG TCTTCCGTGG CACTGGTAGT GATACGAATC TACAACTCGA 
T AC T AAC T AC ACCCTCACGC 

TATTAGTGGC AGAAGGCACC GTGACCATCA CTATGCTTAG ATGTTGAGCT 
ATGATTGATG TGGGAGTGCG 

12 61 CTTTCGACAC CCTACCACAA TGCAACGGTT GTGAAGTACA CGGTGGATAT 
TATATTGGAT GGGTCTCCGT 

GAAAGCTGTG GGATGGTGTT ACGTTGCCAA CACTTCATGT GCCACCTATA 
ATATAACCTA CCCAGAGGCA 

1331 CCAGGACCAA GTCGAGTCGC TTGTCAAACA GCAGGTTAGC CAGTATCCGG 
ACTACGCGCT GACCGTGACC 

GGTCCTGGTT CAGCTCAGCG AACAGTTTGT CGTCCAATCG GTCATAGGCC 
TGATGCGCGA CTGGCACTGG 

1401 GGCCACKCCC TCGGCGCCTC CCTGGCGGCA CTCACTGCCG CCCAGCTGTC 
TGCGACATAC GACAACATCC 

CCGGTGMGGG AGCCGCGGAG GGACCGCCGT GAGTGACGGC GGGTCGACAG 
ACGCTGTATG CTGTTGTAGG 

1471 GCCTGTACAC CTTCGGCGAA CCGCGCAGCG GCAATCAGGC CTTCGCGTCG 
TACATGAACG ATGCCTTCCA 

CGGACATGTG GAAGCCGCTT GGCGCGTCGC CGTTAGTCCG GAAGCGCAGC 
ATGTACTTGC TACGGAAGGT 



1541 AGCCTCGAGC CCAGATACGA CGCAGTATTT CCGGGTCACT CATGCCAACG 
ACGGCATCCC AAACCTGCCC 

TCGGAGCTCG GGTCTATGCT GCGTCATAAA GGCCCAGTGA GTACGGTTGC 
TGCCGTAGGG TTTGGACGGG 



1611 CCGGTGGAGC AGGGGTACGC CCATGGCGGT GTAGAGTACT GGAGCGTTGA 
TCCTTACAGC GCCCAGAACA 

GGCCACCTCG TCCCCATGCG GGTACCGCCA CATCTCATGA CCTCGCAACT 
AGGAATGTCG CGGGTCTTGT 

1681 CATTTGTCTG CACTGGGGAT GAAGTGCAGT GCTGTGAGGC CCAGGGCGGA 
CAGGGTGTGA ATAATGCGCA 

GTAAACAGAC GTGACCCCTA CTTCACGTCA CGACACTCCG GGTCCCGCCT 
GTCCCACACT TATTACGCGT 



1751 CACGACTTAT TTTGGGATGA CGAGCGGCGC ATGCACCTGG CCGGTCGCGG 
CCGCGGAAAC CACTGAAGGA 

GTGCTGAATA AAACCCTACT GCTCGCCGCG TACGTGGACC GGCCAGCGCC 
GGCGCCTTTG GTGACTTCCT 



Xhol 



Ncol 



SphI 



Not I 




1821 TGAGCTGTAA AGAAGCAGAT CGTTCAAACA TTTGGCAATA AAGTTTCTTA 
AGATTGAATC CTGTTGCCGG 

ACTCGACATT TCTTCGTCTA GCAAGTTTGT AAACCGTTAT TTCAAAGAAT 
TCTAACTTAG GACAACGGCC 

18 91 TCTTGCGATG ATTATCATAT AATTTCTGTT GAATTACGTT AAGCATGTAA 
TAATTAACAT GTAATGCATG 

AGAACGCTAC TAATAGTATA TTAAAGACAA CTTAATGCAA TTCGTACATT 

ATTAATTGTA CATTACGTAC 

1961 ACGTTATTTA TGAGATGGGT TTTTATGATT AGAGTCCCGC AATTATACAT 
TTAATACGCG ATAGAAAACA 

TGCAATAAAT ACTCTACCCA AAAATACTAA TCTCAGGGCG TTAATATGTA 
AATTATGCGC TATCTTTTGT 



Clal Hindlll 



2 031 AAATATAGCG CGCAAACTAG GATAAATTAT CGCGCGCGGT GTCATCTATG 
TTACTAGATC GATAAGCTTC 

TTTATATCGC GCGTTTGATC CTATTTAATA GCGCGCGCCA CAGTAGATAC 
AATGATCTAG CTATTCGAAG 

Xbal Bs 



2101 TAGAGCGGCC GGTGGAGCTC CAATTCGCCC TATAGTGAGT CGTATTACGC 
GCGCTCACTG GCCGTCGTTT 

ATCTCGCCGG CCACCTCGAG GTTAAGCGGG ATATCACTCA GCATAATGCG 
CGCGAGTGAC CGGCAGCAAA 

2171 TACAACGTCG TGACTGGGAA AACCCTGGCG TTACCCAACT TAATCGCCTT 
GCAGCACATC CCCCTTTCGC 

ATGTTGCAGC ACTGACCCTT TTGGGACCGC AATGGGTTGA ATTAGCGGAA 
CGTCGTGTAG GGGGAAAGCG 

2241 CAGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGGCCT TCCCAACAGT 
TGCGCAGCCT GAATGGCGAA 

GTCGACCGCA TTATCGCTTC TCCGGGCGTG GCTAGCGGGA AGGGTTGTCA 
ACGCGTCGGA CTTACCGCTT 

2311 TGGGACGCGC CCTGTAGCGG CGCATTAAGC GCGGCGGGTG TGGTGGTTAC 
GCGCAGCGTG ACCGCTACAC 

ACCCTGCGCG GGACATCGCC GCGTAATTCG CGCCGGCCAC ACCACCAATG 
CGCGTCGCAC TGGCGATGTG 

23 81 TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG CTTTCTTCCC TTCCTTTCTC 
GCCACGTTCG CCGGCTTTCC 

AACGGTCGCG GGATCGCGGG CGAGGAAAGC GAAAGAAGGG AAGGAAAGAG 
CGGTGCAAGC GGCCGAAAGG 



Xbal 



BssHII 



BSSHII 




2451 CCGTCAAGCT CTAAATCGGG GGCTCCCTTT AGGGTTCCGA TTTAGTGCTT 
TACGGCACCT CGACCCCAAA 

GG CAGTTCGA GATTTAGCCC CCGAGGGAAA TCCCAAGGCT AAATCACGAA 
ATGCCGTGGA GCTGGGGTTT 

2521 AAACTTGATT AGGGTGATGG TTCACGTAGT GGGCCATCGC CCTGATAGAC 
GGTTTTTCGC CCTTTGACGT 

TTTGAACTAA TCCCACTACC AAGTGCATCA CCCGGTAGCG GGACTATCTG 
CCAAAAAGCG GGAAACTGCA 

2591 TGGAGTCCAC GTTCTTTAAT AGTGGACTCT TGTTCCAAAC TGGAACAACA 
CTCAACCCTA TCTCGGTCTA 

ACCTCAGGTG CAAGAAATTA TCACCTGAGA ACAAGGTTTG ACCTTGTTGT 
GAGTTGGGAT AGAGC CAGAT 

2 661 TTCTTTTGAT TTATAAGGGA TTTTGCCGAT TTCGGCCTAT TGGTTAAAAA 
ATGAGCTGAT TTAACAAAAA 

AAGAAAACTA AATATTCCCT AAAACGGCTA AAGCCGGATA ACCAATTTTT 
TACTCGACTA AATTGTTTTT 

2731 TTTAACGCGA ATTTTAACAA AATATTAACG CTTACAATTT AGGTGGCACT 
TTTCGGGGAA ATGTGCGCGG 

AAATTGCGCT TAAAATTGTT TTATAATTGC GAATGTTAAA TCCACCGTGA 
AAAGCCCCTT TACACGCGCC 

28 01 AACCCCTATT TGTTTATTTT TCTAAATACA TTCAAATATG TATCCGCTCA 
TGAGACAATA ACCCTGATAA 

TTGGGGATAA ACAAATAAAA AGATTTATGT AAGTTTATAC ATAGGCGAGT 
ACTCTGTTAT TGGGACTATT 

28 71 ATGCTTCAAT AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG 
TGTCGCCCTT ATTCCCTTTT 

TACGAAGTTA TTATAACTTT TTCCTTCTCA TACTCATAAG TTGTAAAGGC 
ACAGCGGGAA TAAGGGAAAA 

2941 TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA 
GTAAAAGATG CTGAAGATCA 

AACGCCGTAA AACGGAAGGA CAAAAACGAG TGGGTCTTTG CGACCACTTT 
CATTTTCTAC GACTTCTAGT 

3 011 GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 
TCCTTGAGAG TTTTCGCCCC 

CAACCCACGT GCTCACCCAA TGTAGCTTGA CCTAGAGTTG TCGCCATTCT 
AGGAACTCTC AAAAGCGGGG 

3 081 GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC 
GGTATTATCC CGTATTGACG 

CTTCTTGCAA AAGGTTACTA CTCGTGAAAA TTTCAAGACG ATACACCGCG 
CCATAATAGG GCATAACTGC 

3151 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 
GTTGAGTACT CACCAGTCAC 




GGCCCGTTCT CGTTGAGCCA GCGGCGTATG TGATAAGAGT CTTACTGAAC 
CAACTCATGA GTGGTCAGTG 

3221 AGAAAAGCAT CTTACGGATG GCATGACAGT AAGAGAATTA TGCAGTGCTG 
CCATAACGAT GAGTGATAAC 

TCTTTTCGTA GAATGCCTAC CGTACTGTCA TTCTCTTAAT ACGTCACGAC 
GGTATTGGTA CTCACTATTG 

32 91 ACTGCGGCCA ACTTACTTCT GACAAGGATC GGAGGACCGA AGGAGCTAAC 
CGCTTTTTTG GACAACATGG 

TGACGCCGGT TGAATGAAGA CTGTTGCTAG CCTCCTGGCT TCCTCGATTG 
GCGAAAAAAC GTGTTGTACC 

3361 GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC 
ATACCAAACG ACGAGCGTGA 

CCCTAGTACA TTGAGCGGAA CTAGCAACCC TTGGCCTCGA CTTACTTCGG 
TATGGTTTGC TGCTCGCACT 

3431 CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 
GCGAACTACT TACTCTAGCT 

GTGGTGCTAC GGACATCGTT ACCGTTGTTG CAACGCGTTT GATAATTGAC 
CGCTTGATGA ATGAGATCGA 

35 01 TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
ACTTCTGCGC TCGGCCCTTC 

AGGGGCGTTG TTAATTATCT GACCTACCTC CGCCTATTTC AACGTCCTGG 
TGAAGACGCG AGCCGGGAAG 

3571 CGGCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA GCGTGGGTCT 
CGCGGTATCA TTGCAGCACT 

GCCGACCGAC CAAATAACGA CTATTTAGAC CTCGGCCACT CGCACCCAGA 
GCGCCATAGT AACGTCGTGA 

3 641 GGGGCCAGAT GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA 
GTCAGGCAAC TATGGATGAA 

CCCCGGTCTA CCATTCGGGA GGGCATAGCA TCAATAGATG TGCTGCCCCT 
CAGTCCGTTG ATACCTACTT 

3 711 CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA 
ACTGTCAGAC CAAGTTTACT 

GCTTTATCTG TCTAGCGACT CTATCCACGG AGTGACTAAT TCGTAACCAT 
TGACAGTCTG GTTCAAATGA 

3 781 CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC 
TAGGTGAAGA TCCTTTTTGA 

GTATATATGA AATCTAACTA AATTTTGAAG TAAAAATTAA ATTTTCCTAG 
ATCCACTTCT AGGAAAAACT 



3 851 TAATCTCATG AC C AAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT 
CAGACCCCGT AGAAAAGATC 

ATTAGAGTAC TGGTTTTAGG GAATTGCACT CAAAAGCAAG GTGACTCGCA 
GTCTGGGGCA TCTTTTCTAG 



fv*>. 77 & 



3 921 AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA 
AACAAAAAAA CCACCGCTAC 

TTTCCTAGAA GAACTCTAGG AAAAAAAGAC GCGCATTAGA CGACGAACGT 
TTGTTTTTTT GGTGGCGATG 

3 991 CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG 
GTAACTGGCT TCAGCAGAGC 

GTCGCCACCA AACAAACGGC CTAGTTCTCG ATGGTTGAGA AAAAGGCTTC 
CATTGACCGA AGTCGTCTCG 

4 0 61 GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGTTA GGCCACCACT 
TCAAGAACTC TGTAGCACCG 

CGTCTATGGT TTATGACAGG AAGAT CAC AT CGGCATCAAT CCGGTGGTGA 
AGTTCTTGAG ACATCGTGGC 

4131 CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 
CGATAAGTCG TGTCTTACCG 

GGATGTATGG AGCGAGACGA TTAGGACAAT GGTCACCGAC GACGGTCACC 
GCTATTCAGC ACAGAATGGC 

42 01 GGTTGGACTC AAGAC GATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 
ACGGGGGGTT CGTGCACACA 

CCAACCTGAG TTCTGCTATC AATGGCCTAT TCCGCGTCGC CAGCCCGACT 
TGCCCCCCAA GCACGTGTGT 

42 71 GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG 
AGCTATGAGA AAGCGCCACG 

CGGGTCGAAC CTCGCTTGCT GGATGTGGCT TGACTCTATG GATGTCGCAC 
TCGATACTCT TTCGCGGTGC 

4341 CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG 
AACAGGAGAG CGCACGAGGG 

GAAGGGCTTC CCTCTTTCCG CCTGTCCATA GGCCATTCGC CGTCCCAGCC 
TTGTCCTCTC GCGTGCTCCC 

4411 AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC 
CACCTCTGAC TTGAGCGTCG 

TCGAAGGTCC CCCTTTGCGG AC C AT AGAAA TATCAGGACA GCCCAAAGCG 
GTGGAGACTG AACTCGCAGC 

4481 ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA 
ACGCGGCCTT TTTACGGTTC 

TAAAAACACT ACGAGCAGTC CCCCCGCCTC GGATACCTTT TTGCGGTCGT 
TGCGCCGGAA AAATGCCAAG 

4551 CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC 
TGATTCTGTG GATAACCGTA 

GACCGGAAAA CGACCGGAAA ACGAGTGTAC AAGAAAGGAC GCAATAGGGG 
ACTAAGACAC CTATTGGCAT 

4621 TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG 
CGCAGCGAGT CAGTGAGCGA 

AATGGCGGAA ACTCACTCGA CTATGGCGAG CGGCGTCGGC TTGCTGGCTC 
GCGTCGCTCA GTCACTCGCT 



4691 GGAAGCGGAA GAGCGCCCAA TACGCAAACC GCCTCTCCCC GCGCGTTGGC 
CGATTCATTA ATGCAGCTGG 

CCTTCGCCTT CTCGCGGGTT ATGCGTTTGG CGGAGAGGGG CGCGCAACCG 
GCTAAGTAAT TACGTCGACC 

4761 CACGACAGGT TTCCCGACTG GAAAGCGGGC AGTGAGCGCA ACGCAATTAA 
TGTGAGTTAG CTCACTCATT 

GTGCTGTCCA AAGGGCTGAC CTTTCGCCCG TCACTCGCGT TGCGTTAATT 
ACACTCAATC GAGTGAGTAA 

4831 AGGCACCCCA GGCTTTACAC TTTATGCTTC CGGCTCGTAT GTTGTGTGGA 
ATTGTGAGCG GATAACAATT 

TCCGTGGGGT CCGAAATGTG AAATACGAAG GCCGAGCATA CAACACACCT 
TAACACTCGC CTATTGTTAA 



4 901 TCACACAGGA AACAGCTATG ACCATGATTA CGCCAAGCGC GCAATTAACC 
CTCACTAAAG GGAACAAAAG 

AGTGTGTCCT TTGTCGATAC TGGTACTAAT GCGGTTCGCG CGTTAATTGG 
GAGTGATTTC CCTTGTTTTC 

EcoR 

4971 CTGG 
GACC 



BssHII 




Bgl II (5045) 



Hindlll (2) 

Ncol(9) 



Bam HI (5013) 
Rice actin 



£coRI(4311) 

Xho I (3935) 
4>«T(3924) 
#HII (3883) 




pPQIO.1 

5164 bp 





Atol(733) 
Smal{14\) 
Pstl (749) 
Nos? 

5^HII (984) 
5jjHII(1008) 
Cla I (1035) 
J^in din (1040) 
Xbal (1045) 
Bss HII (1095) 
f1 ori 



amp 



Sequence for pPQIO.l 



Hindi I I Ncol 



1 AAGCTTACCA TGGTGAGCAA GGGCGAGGAG CTGTTCACCG GGGTGGTGCC 
CATCCTGGTC GAGCTGGACG 

TTCGAATGGT ACCACTCGTT CCCGCTCCTC GACAAGTGGC CCCACCACGG 
GTAGGACCAG CTCGACCTGC 

71 GCGACGTGAA CGGCCACAAG TTCAGCGTGT CCGGCGAGGG CGAGGGCGAT 
GCCACCTACG GCAAGCTGAC 

CGCTGCAGTT GCCGGTGTTC AAGTCGCACA GGCCGCTCCC GCTCCCGCTA 
CGGTGGATGC CGTTCGACTG 

141 CCTGAAGTTC ATCTGCACCA GCGGCAAGCT GCCCGTGCCC TGGCCCACCC 
TCGTGACCAC CTTCACCTAC 

GGACTTCAAG TAGACGTGGT GGCCGTTCGA CGGGCACGGG ACCGGGTGGG 
AGCACTGGTG GAAGTGGATG 

211 GGCGTGCAGT GCTTCAGCCG CTACCCCGAC CACATGAAGC AGCACGACTT 
CTTCAAGTCC GCCATGCCCG 

CCGCACGTCA CGAAGTCGGC GATGGGGCTG GTGTACTTCG TCGTGCTGAA 
GAAGTTCAGG CGGTACGGGC 

281 AAGGCTACGT CGAGGAGCGC ACCATCTTCT TCAAGGACGA CGGCAACTAC 
AAGACCCGCG CCGAGGTGAA 

TTCCGATGCA GGTCCTCGCG TGGTAGAAGA AGTTCCTGCT GCCGTTGATG 
TTCTGGGCGC GGCTCCACTT 

3 51 GTTCGAGGGC GACACCCTGG TGAACCGCAT CGAGCTGAAG GGCATCGACT 
TCAAGGAGGA CGGCAACATC 

CAAGCTCCCG CTGTGGGACC ACTTGGCGTA GCTCGACTTC CCGTAGCTGA 
AGTTCCTCCT GCCGTTGTAG 

421 CTGGGGCACA AGCTGGAGTA CAACTACAAC AGCCACAACG TCTATATCAT 
GGCCGACAAG CAGAAGAACG 

GACCCCGTGT TCGACCTCAT GTTGATGTTG TCGGTGTTGC AGATATAGTA 
CCGGCTGTTC GTCTTCTTGC 

491 GCATCAAGGT GAACTTCAAG ATCCGCCACA ACATCGAGGA CGGCAGCGTG 
CAGCTCGCCG ACCACTACCA 

CGTAGTTCCA CTTGAAGTTC TAGGCGGTGT TGTAGCTCCT GGCGTCGCAC 
GTCGAGCGGC TGGTGATGGT 

5 61 GCAGAACACC CCCATCGGCG ACGGCCCCGT GCTGCTGCCC GACAACCACT 
ACCTGAGCAC CCAGTCCGCC 

CGTCTTGTGG GGGTAGCCGC TGCCGGGGCA CGACGACGGG CTGTTGGTGA 
TGGACTCGTG GGTCAGGCGG 

631 CTGAGCAAAG ACCCCAACGA GAAGCGCGAT CACATGGTCC TGGTGGAGTT 
CGTGACCGCC GCCGGGATCA 




CGGCATTATC GCTTCTCCGG GCGTGGCTAG CGGGAAGGGT TGTCAACGCG 
TCGGACTTAC CGCTTACCCT 



1261 CGCGCCCTGT AGCGGCGCAT TAAGCGCGGC GGGTGTGGTG GTTACGCGCA 
GCGTGACCGC TACACTTGCC 

GCGCGGGACA TCGCCGCGTA ATTCGCGCCG CCCACACCAC CAATGCGCGT 
CGCACTGGCG ATGTGAACGG 

1331 AGCGCCCTAG CGGCCGCTCC TTTCGCTTTC TTCCCTTCCT TTCTCGCCAC 
GTTCGCCGGC TTTCCCCGTC 

TCGCGGGATC GCGGGCGAGG AAAGCGAAAG AAGGGAAGGA AAGAGCGGTG 
CAAGCGGCCG AAAGGGGCAG 

14 01 AAGCTCTAAA TCGGGGGCTC CCTTTAGGGT TCCGATTTAG TGCTTTACGG 
CACCTCGACC CCAAAAAACT 

TTCGAGATTT AGCCCCGGAG GGAAATCCCA AGGCTAAATC ACGAAATGCC 
GTGGAGCTGG GGTTTTTTGA 

14 71 TGATTAGGGT GATGGTTCAC GTAGTGGGCC ATCGCCCTGA TAGACGGTTT 
TTCGCCCTTT GACGTTGGAG 

ACTAATCCCA CTACCAAGTG CATCACCCGG TAGCGGGACT ATCTGCCAAA 
AAGCGGGAAA CTGCAACCTC 

1541 TCCACGTTCT TTAATAGTGG ACTCTTGTTC CAAAC TGGAA CAACACTCAA 
CCCTATCTCG GTCTATTCTT 

AGGTGCAAGA AATTATCACC TGAGAACAAG GTTTGACCTT GTTGTGAGTT 
GGGATAGAGC CAGATAAGAA 

1611 TTGATTTATA AGGGATTTTG CCGATTTCGG CCTATTGGTT AAAAAATGAG 
CTGATTTAAC AAAAATTTAA 

AACTAAATAT TCCCTAAAAC GGCTAAAGCC GGATAAC CAA TTTTTTACTC 
GACTAAATTG TTTTTAAATT 

1681 CGCGAATTTT AACAAAATAT TAACGCTTAC AATTTAGGTG GCACTTTTCG 
GGGAAATGTG CGCGGAACCC 

GCGCTTAAAA TTGTTTTATA ATTGCGAATG TTAAATCCAC CGTGAAAAGC 
CCCTTTACAC GCGCCTTGGG 

1751 CTATTTGTTT ATTTTTCTAA ATACATTCAA ATATGTATCC GCTCATGAGA 
CAATAACCCT GATAAATGCT 

GATAAACAAA TAAAAAGATT TATGTAAGTT TATACATAGG CGAGTACTCT 
GTTATTGGGA CTATTTACGA 

1821 TCAATAATAT TGAAAAAGGA AGAGTATGAG TATTCAACAT TTCCGTGTCG 
CCCTTATTCC CTTTTTTGCG 

AGTTATTATA ACTTTTTCCT TCTCATACTC ATAAGTTGTA AAGGCACAGC 
GGGAATAAGG GAAAAAACGC 

1891 GCATTTTGCC TTCCTGTTTT TGCTCACCCA GAAACGCTGG TGAAAGTAAA 
AGATGCTGAA GATCAGTTGG 

CGTAAAACGG AAGGACAAAA ACGAGTGGGT CTTTGCGACC ACTTTCATTT 
TCTACGACTT CTAGTCAACC 




1961 GTGCACGAGT GGGTTACATC GAACTGGATC TCAACAGCGG TAAGATCCTT 
GAGAGTTTTC GCCCCGAAGA 

CACGTGCTCA CCCAATGTAG CTTGACCTAG AGTTGTCGCC ATTCTAGGAA 
CTCTCAAAAG CGGGGCTTCT 

2 031 ACGTTTTCCA ATGATGAGCA CTTTTAAAGT TCTGCTATGT GGCGCGGTAT 
TATCCCGTAT TGACGCCGGG 

TGCAAAAGGT TACTACTCGT GAAAATTTCA AGACGATACA CCGCGCCATA 
ATAGGGCATA ACTGCGGCCC 

2101 CAAGAGCAAC TCGGTCGCCG CATACACTAT TCTCAGAATG ACTTGGTTGA 
GTACTCACCA GTCACAGAAA 

GTTCTCGTTG AGCCAGCGGC GTATGTGATA AGAGTCTTAC TGAACCAACT 
CATGAGTGGT CAGTGTCTTT 

2171 AGCATCTTAC GGATGGCATG ACAGTAAGAG AATTATGCAG TGCTGCCATA 
ACCATGAGTG ATAACACTGC 

TCGTAGAATG CCTACCGTAC TGTCATTCTC TTAATACGTC ACGACGGTAT 
TGGTACTCAC TATTGTGACG 

2241 GGCCAACTTA CTTCTGACAA CGATCGGAGG ACCGAAGGAG CTAACCGCTT 
TTTTGCACAA CATGGGGGAT 

CCGGTTGAAT GAAGACTGTT GCTAGCCTCC TGGCTTCCTC GATTGGCGAA 
AAAACGTGTT GTACCCCCTA 

2311 CATGTAACTC GCCTTGATCG TTGGGAACCG GAGCTGAATG AAGCCATACC 
AAACGACGAG CGTGACACCA 

GTACATTGAG CGGAACTAGC AACCCTTGGC CTCGACTTAC TTCGGTATGG 
TTTGCTGCTC GCACTGTGGT 

23 81 CGATGCCTGT AGCAATGGCA ACAACGTTGC GCAAACTATT AACTGGCGAA 
CTACTTACTC TAGCTTCCCG 

GCTACGGACA TCGTTACCGT TGTTGCAACG CGTTTGATAA TTGACCGCTT 
GATGAATGAG AT CGAAGGG C 

2451 GCAACAATTA ATAGAC TGGA TGGAGGCGGA TAAAGTTGCA GGACCACTTC 
TGCGCTCGGC CCTTCCGGCT 

CGTTGTTAAT TATCTGACCT ACCTCCGCCT ATTTCAACGT CCTGGTGAAG 
ACGCGAGCCG GGAAGGCCGA 

2521 GGCTGGTTTA TTGCTGATAA ATCTGGAGCC GGTGAGCGTG GGTCTCGCGG 
TATCATTGCA GCACTGGGGC 

CCGACCAAAT AACGACTATT TAGACCTCGG CCACTCGCAC CCAGAGCGCC 
ATAGTAACGT CGTGACCCCG 

2 591 CAGATGGTAA GCCCTCCCGT ATCGTAGTTA TCTACACGAC GGGGAGTCAG 
GCAACTATGG ATGAACGAAA 

GTCTAGCATT CGGGAGGGCA TAGCATCAAT AGATGTGCTG CCCCTCAGTC 
CGTTGATACC TACTTGCTTT 

2 6 61 TAGACAGATC GCTGAGATAG GTGCCTCACT GATTAAGCAT TGGTAACTGT 
CAGACCAAGT TTACTCATAT 

ATCTGTCTAG CGACTCTATC CACGGAGTGA CTAATTCGTA ACCATTGACA 
GTCTGGTTCA AATGAGTATA 




2731 ATACTTTAGA TTGATTTAAA ACTTCATTTT TAATTTAAAA GGATCTAGGT 
GAAGATCCTT TTTGATAATC 

TATGAAATCT AACTAAATTT TGAAGTAAAA ATTAAATTTT CCTAGATCCA 
CTTCTAGGAA AAACTATTAG 

2801 TCATGACCAA AATCCGTTAA CGTGAGTTTT CGTTCCACTG AGCGTCAGAC 
CCCGTAGAAA AGATCAAAGG 

AGTACTGGTT TTAGGGAATT GCACTCAAAA GCAAGGTGAC TCGCAGTCTG 
GGGCATCTTT TCTAGTTTCC 

2 871 ATCTTCTTGA GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA 
AAAAACCACC GCTACCAGCG 

TAGAAGAACT CTAGGAAAAA AAGACGCGCA TTAGACGACG AACGTTTGTT 
TTTTTGGTGG CGATGGTCGC 

2 941 GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC 
TGGCTTCAGC AGAGCGCAGA 

CACCAAACAA ACGGCCTAGT TCTCGATGGT TGAGAAAAAG GCTTCCATTG 
ACCGAAGTCG TCTCGCGTCT 

3 011 TACCAAATAC TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 
AACTCTGTAG CACCGCCTAC 

ATGGTTTATG ACAGGAAGAT CACATCGGCA TCAATCCGGT GGTGAAGTTC 
TTGAGACATC GTGGCGGATG 

3 0 81 ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA 
AGTCGTGTCT TACCGGGTTG 

TATGGAGCGA GACGATTAGG ACAATGGTCA CCGACGACGG TCACCGCTAT 
TCAGCACAGA ATGGCCCAAC 

3151 GAGTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG 
GGGTTCGTGC ACACAGCCCA 

CTGAGTTCTG CTATCAATGG CCTATTCCGC GTCGCCAGCC CGACTTGCCC 
CCCAAGCACG TGTGTCGGGT 

3 221 GCTTGGAGCG AACGAGCTAC ACCGAACTGA GATACCTACA GCGTGAGCTA 
TGAGAAAGCG CCACGCTTCC 

CGAACCTCGC TTGCTGGATG TGGCTTGACT CTATGGATGT CGCACTCGAT 
ACTCTTTCGC GGTGCGAAGG 

3 2 91 CGAAGGGAGA AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 
GAGAGCGCAC GAGGGAGCTT 

GCTTCCCTCT TTCCGCCTGT CCATAGGCCA TTCGCCGTCC CAGCCTTGTC 
CTCTCGCGTG CTCCCTCGAA 

33 61 CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGGCACCT 
CTGACTTGAG CGTCGATTTT 

GGTCCCCCTT TGGGGACCAT AGAAAT AT C A GGACAGCCCA AAGCGGTGGA 
GACTGAACTC GCAGCTAAAA 

3431 TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 
GCCTTTTTAC GGTTCCTGGC 




ACACTACGAG CAGTCCCCCC GCCTCGGATA CCTTTTTGCG GTCGTTGCGC 
CGGAAAAATG CGAAGGACCG 



3 501 CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA TCCCCTGATT 
CTGTGGATAA CCGTATTACC 

GAAAACGACC GGAAAACGAG TGTACAAGAA AGGACGCAAT AGGGGACTAA 
GACACCTATT GGCATAATGG 

3 571 GCCTTTGAGT GAGCTGATAC CGCTCGCCGC AGCCGAACGA CCGAGCGCAG 
CGAGTCAGTG AGC GAGGAAG 

CGGAAACTCA CTCGACTATG GCGAGCGGCG TCGGCTTGCT GGCTCGCGTC 
GCTCAGTCAC TCGCTCCTTC 

3641 CGGAAGAGCG CCCAATACGC AAACCGCCTC TCCCCGCGCG TTGGCCGATT 
CATTAATGCA GCTGGCACGA 

GCCTTCTCGC GGGTTATGCG TTTGGCGGAG AGGGGCGCGC AACCGGCTAA 
GTAATTACGT CGACCGTGCT 

3 711 CAGGTTTCCC GACTGGAAAG CGGGCAGTGA GCGCAACGCA ATTAATGTGA 
GTTAGCTCAC TCATTAGGCA 

GTCCAAAGGG CTGACCTTTC GCCCGTCACT CGCGTTGCGT TAATTACACT 
CAATCGAGTG AGTAATCCGT 

3 781 CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATGTTGT GTGGAATTGT 
GAGCGGATAA CAATTTCACA 

GGGGTCCGAA ATGTGAAATA CGAAGGCCGA GCATACAACA CACCTTAACA 
CTCGCCTATT GTTAAAGTGT 



3 851 CAGGAAACAG CTATGACCAT GATTACGCCA AGCGCGCAAT TAACCCTCAC 
TAAAGGGAAC AAAAGCTGGG 

GTCCTTTGTC GATAC TGGT A CTAATGCGGT TCGCGCGTTA ATTGGGAGTG 
ATTTCCCTTG TTTTCGACCC 



3 921 TACCGGGCCC CCCCTCGAGG TCATTCATAT GCTTGAGAAG AGAGTCGGGA 
TAGTCCAAAA TAAAACAAAG 

ATGGCCCGGG GGGGAGCTCC AGTAAGTATA CGAACTCTTC TCTCAGCCCT 
ATCAGGTTTT ATTTTGTTTC 

3 9 91 GTAAGATTAC CTGGTCAAAA GTGAAAACAT CAGTTAAAAG GTGGTATAAG 
TAAAATATCG GTAATAAAAG 

CATTCTAATG GACCAGTTTT CACTTTTGTA GTGAATTTTC CACCATATTC 
ATTTTATAGC CATTATTTTC 

4061 GTGGCCCAAA GTGAAATTTA CTCTTTTCTA C TAT TAT AAA AATTGAGGAT 
GTTTTGTCGG TACTTTGATA 

CACCGGGTTT CACTTTAAAT GAGAAAAGAT GATAATATTT TTAACTCCTA 
CAAAACAGGC ATGAAACTAT 



BssHII 



Kpnl 



Kpnl 



Xhol 




4131 CGTCATTTTT GTATGAATTG GTTTTTAAGT TTATTCGCGA TTTGGAAATG 
CATATCTGTA TTTGAGTCGG 

GCAGTAAAAA CATACTTAAC CAAAAATTCA AATAAGCGCT AAACCTTTAC 
GTATAGACAT AAACTCAGCC 

42 01 TTTTTAAGTT CGTTGCTTTT GTAAATACAG AGGGATTTGT ATAAGAAATA 
TCTTTAAAAA ACCCATATGC 

AAAAATTCAA GCAACGAAAA CATTTATGTC TCCCTAAACA TATTCTTTAT 
AGAAATTTTT TGGGTATACG 



4271 TAATTTGACA TAATTTTTGA GAAAAATATA TATTCAGGCG AATTCCACAA 
TGAACAATAA TAAGATTAAA 

ATTAAACTGT ATTAAAAACT CTTTTTATAT ATAAGTCCGC TTAAGGTGTT 
ACTTGTTATT ATTCTAATTT 

4341 ATAGCTTGCC CGCGTTGCAG CGATGGGTAT TTTTTCTAGT AAAATAAAAG 
ATAAACTTAG ACTCAAAACA 

TATCGAACGG GGGCAACGTC GCTACCCATA AAAAAGATCA TTTTATTTTC 
TATTTGAATC TGAGTTTTGT 

4411 TTTACAAAAA CAACCCCTAA AGTCCTAAAG CCCAAAGTGC TATGCACGAT 
C CAT AG C AAG GCCAGCCCAA 

AAATGTTTTT GTTGGGGATT TCAGGATTTC GGGTTTCACG ATACGTGCTA 
GGTATCGTTC GGGTCGGGTT 

44 81 CCCAACCCAA CCCAACCCAC CCCAGTGCAG CCAACTGGCA AATAGTCTCC 
ACCCCCGGCA CTATCACCGT 

GGGTTGGGTT GGGTTGGGTG GGGTCACGTC GGTTGAC CGT T TAT CAGAGG 
TGGGGGCCGT GATAGTGGCA 

4551 GAGTTGTCCG CACCACCGCA CGTCTCGCAG CCAAAAAAAA AAAAAGAAAG 
AAAAAAAAGA AAAAGAAAAA 

CTCAACAGGC GTGGTGGCGT GCAGAGCGTC GGTTTTTTTT TTTTTCTTTC 
TTTTTTTTCT TTTTCTTTTT 

4621 CAGCAGGTGG GTCCGGGTCG TGGGGGCCGG AAAAGCGAGG AGGATCGCGA 
GCAGCGACGA GGCCCGGCCC 

GTCGTCCACC CAGGCCCAGC ACCCCCGGCC TTTTCGCTCC TCCTAGCGCT 
CGTCGCTGCT CCGGGCCGGG 

46 91 TCCCTCCGCT T C C AAAG AAA CGCCCCCCAT CGCCACTATA TACATACCCC 
CCCCTCTCCT CCCATCCCCC 

AGGGAGGCGA AGGTTTCTTT GCGGGGGGTA GCGGTGATAT ATGTATGGGG 
GGGGAGAGGA GGGTAGGGGG 

4761 CAAGCCTACC ACCACCACCA CCACCACCTC CTCCCCCCTC GCTGCCGGAC 
GACGAGCTCC TCCCCCCTCC 

GTTGGGATGG TGGTGGTGGT GGTGGTGGAG GAGGGGGGAG CGACGGCCTG 
CTGCTCGAGG AGGGGGGAGG 

4831 CCCTCCGCCG CCGCCGGTAA CCACCCCGCC CCTCTCCTCT TTCTTTCTCC 
GTTTTTTTTT TCGTCTCGGT 



EcoRI 




G 



GGGAGGCGGC GGCGGCCATT GGTGGGGCGG GGAGAGGAGA AAGAAAGAGG 



CAAAAAAAAA AGCAGAGCCA 

49 01 CTCGATCTTT GGCCTTGGTA GTTTGGGTGG GCGAGAGCGG CTTCGTCGCC 
CAGATCGGTG CGCGGGAGGG 

GAGCTAGAAA CCGGAACCAT CAAACCCACC CGCTCTCGCC GAAGCAGCGG 
GTCTAGCGAC GCGCCCTCCC 



4 971 GCGGGATCTC GCGGCTGGCG TCTCCGGGCG TGAGTCGGCC CGGATCCTCG 
CGGGGAATGG GGCTCTCGGA 

CGCCCTAGAG CGCCGACCGC AGAGGCCCGC ACTCAGCCGG GCCTAGGAGC 
GCCCCTTACC CCGAGAGCCT 



5041 TGTAGATCTT CTTTCTTTCT TCTTTTTGTG GTAGAATTTG AATCCCTCAG 
GATTGTTCAT CGGTAGTTTT 

ACATCTAGAA GAAAGAAAGA AGAAAAAGAC CATCTTAAAC TTAGGGAGTC 

GTAACAAGTA GCCATCAAAA 

5111 TCTTTTCATG ATTTGTGACA AATGCAGCCT CGTGCGGAGC TTTTTTGTAG GTAG 



AGAAAAGTAC TAAACACTGT TTACGTCGGA GCACGCCTCG AAAAAACATC CATC 



BamHI 



Bglll 




BssRll (4929) 



Ecom{\) 




Bam HI (703) 



P 



11 



pJQ3 

4965 bp 




f1 ori 



Is? 




Sg/II (735) 
Hin dill (856) 
RST (GOLGI) 
JVofl (998) 
5a/I(1098) 
,4 cc I (1099) 
Bam HI (1154) 
SIG-VAC-FAE-F/S 
JftoI(1536) 
Nco I (1623) 
5pA I (1775) 
M>fl(1790) 
555 HII (2030) 
Bss HII (2054) 
C/a I (2081) 
if hi dill (2086) 
Xbal(2m) 
Bss HII (2141) 



Sequence for pJQ3 



EcoRI 



1 AATTCCACAA TGAACAATAA TAAGATTAAA ATAGCTTGCC CCCGTTGCAG 
CGATGGGTAT TTTTTCTAGT 

TTAAGGTGTT ACTTGTTATT ATTCTAATTT TATCGAACGG GGGCAACGTC 

GCTACCCATA AAAAAG AT C A 

71 AAAATAAAAG ATAAAC TTAG ACTCAAAACA TTTACAAAAA CAACCCCTAA 
AGTCCTAAAG CCCAAAGTGC 

TTTTATTTTC TATTTGAATC TGAGTTTTGT AAATGTTTTT GTTGGGGATT 

TCAGGATTTC GGGTTTCACG 

141 TATGCACGAT CCATAGCAAG CCCAGCCCAA CCCAACCCAA CCCAACCCAC 
CCCAGTGCAG CCAACTGGCA 

ATACGTGCTA GGTATCGTTC GGGTCGGGTT GGGTTGGGTT GGGTTGGGTG 
GGGTCACGTC GGTTGACCGT 

211 AATAGTCTCC ACCCCCGGCA CTATCACCGT GAGTTGTCCG CACCACCGCA 
CGTCTCGCAG CCAAAAAAAA 

TTATCAGAGG TGGGGGCCGT GATAGTGGCA CTCAACAGGC GTGGTGGCGT 
GCAGAGCGTC GGTTTTTTTT 

281 AAAAAGAAAG AAAAAAAAGA AAAAGAAAAA CAGCAGGTGG GTCCGGGTCG 
TGGGGGCGGG AAAAGCGAGG 

TTTTTCTTTC TTTTTTTTCT TTTTCTTTTT GTCGTCGACC CAGGCCCAGC 

ACCCCCGGCC TTTTCGCTCC 

351 AGGATCGCGA GCAGCGACGA GGCCCGGCCC TCCCTCCGCT TCCAAAGAAA 
CGCCCCCCAT CGCCACTATA 

TCCTAGCGCT CGTCGCTGCT CCGGGCCGGG AGGGAGGCGA AGGTTTCTTT 
GCGGGGGGTA GCGGTGATAT 

421 TACATACCCC CCCCTCTCCT CCCATCCCCC CAACCCTACC ACCACCACCA 
CCACCACCTC CTGCCCCCTC 

ATGTATGGGG GGGGAGAGGA GGGTAGGGGG GTTGGGATGG TGGTGGTGGT 
GGTGGTGGAG GAGGGGGGAG 

491 GCTGCCGGAC GACGAGCTCC TCCCCCCTCC CCCTCGGCCG CCGCCGGTAA 
CCACCCCGCC CCTCTCCTCT 

CGACGGCCTG CTGCTCGAGG AGGGGGGAGG GGGAGGCGGC GGCGGCCATT 
GGTGGGGCGG GGAGAGGAGA 

561 TTCTTTCTCC GTTTTTTTTT TCGTCTCGGT CTCGATCTTT GGCCTTGGTA 
GTTTGGGTGG GCGAGAGCGG 

AAGAAAGAGG CAAAAAAAAA AGCAGAGCCA GAGCTAGAAA CCGGAACCAT 
CAAAGCCACC CGCTCTCGCC 

631 CTTCGTCGCC CAGATCGGTG CGCGGGAGGG GCGGGATCTC GCGGCTGGCG 
TCTCCGGGCG TGAGTGGGCC 




GAAGCAGCGG GTCTAGCCAC GCGCCCTCCC CGCCCTAGAG CGCCGACCGC 
AGAGGCCCGC ACTCAGCCGG 



BamHI 



Bglll 



701 CGGATCCTCG CGGGGAATGG GGCTCTCGGA TGTAGATCTT CTTTCTTTCT 
TCTTTTTGTG GTAGAATTTG 

GCCTAGGAGC GCCCCTTACC CCGAGAGCCT ACATCTAGAA GAAAGAAAGA 
AGAAAAACAC CATCTTAAAC 

771 AATCCCTCAG CATTGTTCAT CGGTAGTTTT TCTTTTCATG ATTTGTGACA 
AATGCAGCCT CGTGCGGAGC 

TTAGGGAGTC GTAACAAGTA GCCATCAAAA AGAAAAGTAC TAAACACTGT 
TTACGTCGGA GCACGCCTCG 



841 TTTTTTGTAG GTAGAAGCTT ACCATGATCC ACACCAACCT CAAAAAGAAG 
TTCTCCCTCT TCATCCTCGT 

AAAAAACATC CATCTTCGAA TGGTACTAGG TGTGGTTGGA GTTTTTCTTC 
AAGAGGGAGA AGTAGGAGCA 

911 CTTCCTCCTC TTCGCCGTGA TCTGCGTGTG GAAGAAGGGC TCCGACTACG 
AGGCCCTCAC CCTCCAAGCC 

GAAGGAGGAG AAGCGGCACT AGACG C AC AC CTTCTTCCCG AGGCTGATGC 
TCCGGGAGTG GGAGGTTCGG 



981 AAGGAGTTCC AAATGGCGGC CGCCTCCACG CAGGGCATCT CCGAAGACCT 
CTACAGCCGT TTAGTCGAAA 

TTCCTCAAGG TTTACCGCCG GCGGAGGTGC GTCCCGTAGA GGCTTCTGGA 
GATGTCGGCA AATCAGCTTT 



1051 TGGCCACTAT CTCCCAAGCT GCCTACGCCG ACCTGTGCAA CATTCCGTCG 
ACTATTATCA AGGGAGAGAA 

ACCGGTGATA GAGGGTTCGA CGGATGCGGC TGGACACGTT GTAAGGCAGC 
TGATAATAGT TCCCTCTCTT 



1121 AATTTACAAT TCTCAAACTG ACATTAACGG ATGGATCCTC CGCGACGACA 
GCAGCAAAGA AATAATCACC 

TTAAATGTTA AGAGTTTGAC TGTAATTGCC TACCTAGGAG GCGCTGCTGT 
CGTCGTTTCT TTATTAGTGG 

1191 GTCTTCCGTG GCACTGGTAG TGATACGAAT CTACAACTCG ATACTAACTA 
CACCCTCACG CCTTTCGACA 

CAGAAGGCAC CGTGACCATC ACTATGCTTA GATGTTGAGC TATGATTGAT 
GTGGGAGTGC GGAAAGCTGT 



Hindlll 



Not I 



Sail 



AccI 



BamHI 




1261 CCCTACCACA ATGCAACGGT TGTGAAGTAC ACGGTGGATA TTATATTGGA 
TGGGTCTCCG TCCAGGACCA 

GGGATGGTGT TACGTTGCCA ACACTTCATG TGCCACCTAT AATATAACCT 
ACCCAGAGGC AGGTCCTGGT 

1331 AGTCGAGTCG CTTGTCAAAC AGCAGGTTAG CCAGTATCCG GACTACGCGC 
TGACCGTGAC CGGCCACKCC 

TCAGCTCAGC GAACAGTTTG TCGTCCAATC GGTCATAGGC CTGATGCGCG 
ACTGGCACTG GCCGGTGMGG 

1401 CTCGGCGCCT CCCTGGCGGC ACTCACTGCC GCCCAGCTGT CTGCGACATA 
CGACAACATC CGCCTGTACA 

GAGCCGCGGA GGGACCGCCG TGAGTGACGG CGGGTCGACA GACGCTGTAT 
GCTGTTGTAG GCGGACATGT 



1471 CCTTCGGCGA ACCGCGCAGC GGCAATCAGG CCTTCGCGTC GTACATGAAC 
GATGCCTTCC AAGCCTCGAG 

GGAAGCCGCT TGGCGCGTCG CCGTTAGTCC GGAAGCGCAG CATGTACTTG 
CTACGGAAGG TTCGGAGCTC 

1541 CCCAGATACG ACGCAGTATT TCCGGGTCAC TCATGCCAAC GACGGCATCC 
CAAACCTGCC CCCGGTGGAG 

GGGTCTATGC TGCGTCATAA AGGCCCAGTG AGTACGGTTG CTGCCGTAGG 
GTTTGGACGG GGGCCACCTC 



1611 CAGGGGTACG CCCATGGCGG TGTAGAGTAC TGGAGCGTTG ATCCTTACAG 
CGCCCAGAAC ACATTTGTCT 

GTCCCCATGC GGGTACCGCC ACATCTCATG ACCTCGCAAC TAGGAATGTC 
GCGGGTCTTG TGTAAACAGA 

1681 GCACTGGGGA TGAAGTGCAG TGCTGTGAGG CCCAGGGCGG ACAGGGTGTG 
AATAATGCGC ACACGACTTA 

CGTGACCCCT ACTTCACGTC ACGACACTCC GGGTCCCGCC TGTCCCACAC 
TTATTACGCG TGTGCTGAAT 



1751 TTTTGGGATG ACGAGCGGCG CATGCACCTG GCCGGTCGCG GCCGCGGAAA 
CCACTGAAGG ATGAGCTGTA 

AAAACCCTAG TGCTCGCCGC GTACGTGGAC CGGCGAGCGC CGGCGCCTTT 
GGTGACTTCC TACTGGACAT 

1821 AAGAAGCAGA TCGTTCAAAC ATTTGGCAAT AAAGTTTCTT AAGATTGAAT 
CCTGTTGCCG GTCTTGCGAT 

TTCTTCGTCT AGCAAGTTTG TAAACCGTTA TTTCAAAGAA TTCTAACTTA 
GGACAACGGC CAGAACGCTA 



Xhol 



Ncol 



SphI 



Not I 




1891 GATTATGATA TAATTTCTGT TGAATTACGT TAAGCATGTA ATAATTAACA 
TGTAATGCAT GACGTTATTT 

CTAATAGTAT ATTAAAGACA ACTTAATGCA ATTCGTACAT TATTAATTGT 
ACATTACGTA CTGCAATAAA 



19 61 ATGAGATGGG TTTTTATGAT TAGAGTCCCG CAATTATACA TTTAATACGC 
GATAGAAAAC AAAATATAGC 

TACTCTACCC AAAAATACTA ATCTCAGGGC GTTAATATGT AAATTATGCG 
CTATCTTTTG TTTTATATCG 



2 031 GCGCAAACTA GGATAAATTA TCGCGCGCGG TGTCATCTAT GTTACTAGAT 
CGATAAGCTT CTAGAGCGGC 

CGCGTTTGAT CCTATTTAAT AGCGCGCGCC ACAGTAGATA CAATGATCTA 
GCTATTCGAA GATCTCGCCG 



2101 CGGTGGAGCT CCAATTCGCC CTATAGTGAG TCGTATTACG CGCGCTCACT 
GGCCGTCGTT TTACAACGTC 

GCCACGTCGA GGTTAAGCGG GATATCACTC AGCATAATGC GCGCGAGTGA 
CCGGCAGCAA AATGTTGCAG 

2171 GTGACTGGGA AAACCCTGGC GTTACCCAAC TTAATCGCCT TGCAGCACAT 
CCCCCTTTCG CCAGCTGGCG 

CACTGACCCT TTTGGGACCG CAATGGGTTG AATTAGCGGA ACGTCGTGTA 
GGGGGAAAGC GGTCGACCGG 

2241 TAATAGCGAA GAGGCCCGCA CCGATCGCCC TTCCCAACAG TTGCGCAGCC 
TGAATGGCGA ATGGGACGCG 

ATTATCGCTT CTCCGGGCGT GGCTAGCGGG AAGGGTTGTC AACGCGTCGG 
ACTTACCGCT TACCCTGCGC 

2311 CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT 
GACCGCTACA CTTGCCAGCG 

GGGACATCGC CGCGTAATTC GCGCCGCCCA CACCACCAAT GCGCGTCGCA 
CTGGCGATGT GAACGGTCGC 

23 81 CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT CGCCACGTTC 
GCCGGCTTTC CCCGTCAAGC 

GGGATCGCGG GCGAGGAAAG CGAAAGAAGG GAAGGAAAGA GCGGTGCAAG 
CGGCCGAAAG GGGCAGTTCG 



BssHII 



Xbal 



BssHII 



BssHII 



CI 



Hindi II 



BssHII 




24 51 TCTAAATCGG GGGCTCCCTT TAGGGTTCCG ATTTAGTGCT TTACGGCACC 
TCGACCCCAA AAAACTTGAT 

AGATTTAGCC CCCGAGGGAA ATCCCAAGGC TAAATCACGA AATGCCGTGG 
AGCTGGGGTT TTTTGAACTA 

2521 TAGGGTGATG GTTCACGTAG TGGGCCATCG CCCTGATAGA CGGTTTTTCG 
CCCTTTGACG TTGGAGTCCA 

ATCCCACTAC CAAGTGCATC ACCCGGTAGC GGGACTAT C T GCCAAAAAGC 
GGGAAACTGC AACCTCAGGT 

2591 CGTTCTTTAA TAGTGGACTC TTGTTCCAAA CTGGAACAAC ACTCAACCCT 
ATCTCGGTCT ATTCTTTTGA 

GCAAGAAATT ATCACCTGAG AACAAGGTTT GACCTTGTTG TGAGTTGGGA 
TAGAGCCAGA TAAGAAAACT 

2 661 TTTATAAGGG ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA AATGAGCTGA 
TTTAACAAAA ATTTAACGCG 

AAATATTCCC TAAAACGGCT AAAGGCGGAT AACCAATTTT TTACTCGACT 
AAATTGTTTT TAAATTGCGC 

2731 AATTTTAACA AAATATTAAC GCTTACAATT TAGGTGGCAC TTTTCGGGGA 
AATGTGCGCG GAACCCCTAT 

TTAAAATTGT TTTATAATTG CGAATGTTAA ATCCACCGTG AAAAGCCCCT 
TTACACGCGC CTTGGGGATA 

28 01 TTGTTTATTT TTCTAAATAC ATTCAAATAT GTATCCGCTC ATGAGACAAT 
AACCCTGATA AATGCTTCAA 

AACAAATAAA AAGATTTATG TAAGTTTATA CATAGGCGAG TACTCTGTTA 
TTGGGACTAT TTACGAAGTT 

2 871 TAATATTGAA AAAGGAAGAG TATGAGTATT CAACATTTCC GTGTCGCCCT 
TATTCCCTTT TTTGCGGCAT 

ATTATAACTT TTTCCTTCTC ATACTCATAA GTTGTAAAGG CACAGCGGGA 
ATAAGGGAAA AAACGCCGTA 

2 941 TTTGCCTTCC TGTTTTTGCT CACC CAGAAA CGCTGGTGAA AGTAAAAGAT 
GCTGAAGATC AGTTGGGTGC 

AAACGGAAGG ACAAAAACGA GTGGGTCTTT GCGACCACTT TCATTTTCTA 
CGACTTCTAG TCAACCCACG 

3 011 ACGAGTGGGT TACATCGAAC TGGATCTCAA CAGGGGTAAG ATCCTTGAGA 
GTTTTCGCCC CGAAGAACGT 

TGCTCACCCA ATGTAGCTTG ACCTAGAGTT GTCGCCATTC TAGGAACTCT 
CAAAAGCGGG GCTTCTTGCA 

3 081 TTTCCAATGA TGAGCACTTT TAAAGTTCTG CTATGTGGCG CGGTATTATC 
CCGTATTGAC GCCGGGCAAG 

AAAGGTTACT ACTCGTGAAA ATTT CAAGAC GATACACCGC GC CATAATAG 
GGCATAACTG CGGCCCGTTC 

3151 AGCAACTCGG TCGCCGCATA CACTATTCTC AGAATGACTT GGTTGAGTAC 
TCACCAGTCA CAGAAAAGCA 

TCGTTGAGCC AGCGGCGTAT GTGATAAGAG TCTTACTGAA CCAACTCATG 
AGTGGTCAGT GTCTTTTCGT 




3221 TCTTACGGAT GGCATGACAG TAAGAGAATT ATGCAGTGCT GCCATAACCA 
TGAGTGATAA CACTGCGGCC 

AGAATGCCTA CCGTACTGTC ATTCTCTTAA TACGTCACGA CGGTATTGGT 
ACTCACTATT GTGACGCCGG 

3 291 AACTTACTTC TGACAACGAT CGGAGGACCG AAGGAGC TAA CCGCTTTTTT 
GCACAACATG GGGGATCATG 

TTGAATGAAG ACTGTTGCTA GCCTCCTGGC TTCCTCGATT GGCGAAAAAA 
CGTGTTGTAC CCCCTAGTAC 

3 361 TAACTCGCCT TGATCGTTGG GAACCGGAGC TGAATGAAGC CATACCAAAC 
GACGAGCGTG ACACCACGAT 

ATTGAGCGGA ACTAGCAACC CTTGGCCTCG ACTTACTTCG GTATGGTTTG 
CTGCTCGCAC TGTGGTGCTA 

3431 GCCTGTAGCA ATGGCAACAA CGTTGCGCAA ACTATTAACT GGCGAACTAC 
TTACTGTAGC TTCCCGGCAA 

CGGACATCGT TACCGTTGTT GCAACGCGTT TGATAATTGA CCGCTTGATG 
AATGAGATCG AAGGGCCGTT 

3 501 CAATTAATAG ACTGGATGGA GGCGGATAAA GTTGCAGGAC CACTTCTGCG 
CTCGGCCCTT CCGGCTGGCT 

GTTAATTATC TGACCTACCT CCGCCTATTT CAACGTCCTG GTGAAGAGGC 
GAGCCGGGAA GGCCGACCGA 

3571 GGTTTATTGC TGATAAATCT GGAGCCGGTG AGCGTGGGTC TCGCGGTATC 
ATTGCAGCAC TGGGGCCAGA 

CCAAATAACG AC TATTTAGA CCTCGGCCAC TCGCACCCAG AGCGCCATAG 
TAACGTCGTG ACCCCGGTCT 

3 641 TGGTAAGCCC TCCCGTATCG TAGTTATCTA CACGACGGGG AGTCAGGCAA 
CTATGGATGA ACGAAATAGA 

ACCATTCGGG AGGGCATAGC ATCAATAGAT GTGCTGCCCC TCAGTCCGTT 
GATACCTACT TGCTTTATCT 

3 711 CAGATCGCTG AGATAGGTGC CTCACTGATT AAGCATTGGT AACTGTCAGA 
CCAAGTTTAC TCATATATAC 

GTCTAGCGAC TCTATCCACG GAGTGACTAA TTCGTAACCA TTGACAGTCT 
GGTTCAAATG AGTATATATG 

3 781 TTTAGATTGA TTTAAAACTT CATTTTTAAT TTAAAAGGAT CTAGGTGAAG 
ATCCTTTTTG ATAATCTCAT 

AAATCTAACT AAATTTTGAA GTAAAAATTA AATTTTCCTA GATCCACTTC 
TAGGAAAAAC TATTAGAGTA 

3 851 GACCAAAATC CCTTAACGTG AGTTTTCGTT CCACTGAGCG TCAGACCCCG 
TAGAAAAGAT CAAAGGATCT 

CTGGTTTTAG GGAATTGCAC TCAAAAGCAA GGTGACTCGC AGTCTGGGGC 
ATCTTTTCTA GTTTCCTAGA 

3 9 21 TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA 
ACCACCGCTA CCAGCGGTGG 




AGAACTCTAG GAAAAAAAGA CGCGCATTAG ACGACGAACG TTTGTTTTTT 
TGGTGGCGAT GGTCGCCACC 



3 991 TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA GGTAACTGGC 
TTCAGCAGAG CGCAGATACC 

AAACAAACGG CCTAGTTCTC GATGGTTGAG AAAAAGGCTT CCATTGAGCG 
AAGTCGTCTC GCGTCTATGG 

40 61 AAATACTGTC CTTCTAGTGT AGCCGTAGTT AGGCCACCAC TTCAAGAACT 
CTGTAGCACC GCCTACATAC 

TTTATGACAG GAAGAT CACA TCGGCATCAA TCCGGTGGTG AAGTTCTTGA 

GACATCGTGG CGGATGTATG 

4131 CTCGCTCTGC TAATGCTGTT ACCAGTGGCT GCTGCCAGTG GCGATAAGTC 
GTGTCTTACC GGGTTGGACT 

GAGCGAGACG ATTAGGACAA TGGTCACCGA CGACGGTCAC CGCTATTCAG 

CACAGAATGG CCCAACCTGA 

42 01 CAAGACGATA GTTACCGGAT AAGGCGCAGC GGTCGGGCTG AACGGGGGGT 
TCGTGCACAC AGCCCAGCTT 

GTTCTGCTAT CAATGGCCTA TTCCGCGTCG CCAGCCCGAC TTGCCCCCCA 

AGCACGTGTG TCGGGTCGAA 

42 71 GGAGCGAACG ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG 
AAAGCGCCAC GCTTCCCGAA 

CCTGGCTTGC TGGATGTGGC TTGACTCTAT GGATGTCGCA CTCGATACTC 
TTTCGCGGTG CGAAGGGCTT 

4341 GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA 
GCGCACGAGG GAGCTTCCAG 

CCCTCTTTCC GCCTGTCCAT AGGCCATTCG CCGTCCCAGC CTTGTCCTCT 
CGCGTGCTCC CTCGAAGGTC 

4411 GGGGAAACGC CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA 
CTTGAGCGTC GATTTTTGTG 

CCCGTTTGCG GACCATAGAA AT AT CAGGAC AGCCCAAAGC GGTGGAGACT 
GAACTCGCAG CTAAAAACAC 

4481 ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC AACGCGGCCT 
TTTTACGGTT CCTGGCCTTT 

TACGAGCAGT CCCCCCGCCT CGGATACCTT TTTGCGGTCG TTGCGCCGGA 
AAAATGCCAA GGACCGGAAA 

4551 TGCTGGCCTT TTGCTCACAT GTTCTTTCCT GCGTTATCCC CTGATTCTGT 
GGATAACCGT ATTACCGCCT 

ACGACCGGAA AACGAGTGTA CAAGAAAGGA CGCAATAGGG GACTAAGACA 
CCTATTGGCA TAATGGCGGA 

4621 TTGAGTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA GCGCAGCGAG 
TCAGTGAGCG AGGAAGCGGA 

AACTCACTCG ACTATGGCGA GCGGCGTCGG CTTGCTGGCT CGCGTCGCTC 
AGTCACTCGC TCCTTCGCCT 




46 91 AGAGCGCCCA ATACGCAAAC CGCCTCTCCC CGCGCGTTGG CCGATTCATT 
AATGCAGCTG GCACGACAGG 

TCTCGGGGGT TATGCGTTTG GCGGAGAGGG GCGCGCAACC GGCTAAGTAA 
TTACGTCGAC CGTGCTGTCC 

47 61 TTTCCCGACT GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAGTTA 
GCTCACTCAT TAGGCACCCC 

AAAGGGCTGA CCTTTCGCCC GTCACTCGCG TTGCGTTAAT TACACTCAAT 
CGAGTGAGTA ATCCGTGGGG 

4831 AGGCTTTACA CTTTATGCTT CCGGCTCGTA TGTTGTGTGG AATTGTGAGC 
GGATAACAAT TTCACACAGG 

TCCGAAATGT GAAATACGAA GGCCGAGCAT ACAACACACC TTAACACTCG 
CCTATTGTTA AAGTGTGTCC 

BSSHII 

EcoRI 

4 901 AAACAGC TAT GAC CATGATT ACGCCAAGCG CGCAATTAAC CCTCACTAAA 

GGGAACAAAA GCTGG 

TTTGTCGATA CTGGTACTAA TGCGGTTCGC GCGTTAATTG GGAGTGATTT 

CCCTTGTTTT CGACC 



Fry He X 



Figure A- 




Figure Hlh 



NCOl 



Hilldll I 



M AHA RVL LLAL AVL ATA AVAV 
1 AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTCGCCG 

Not I 



• ASS R A A ASTQ GIS EDL YSRL V E M 
71 TCGCCTCCTC CCGCGCGGCC GCCTCCACGC AGGGCATCTC CGAAGACCTC TACAGCCGTT TAGTCGAAAT 

Sail 



•ATI S Q A A YAD LCN IPST I I K GEK 
141 GGCCACTATC TCCCAAGCTG CCTACGCCGA CCTGTGCAAC ATTCCGTCGA CTATTATCAA GGGAGAGAAA 

BamHI 



IYNS QTD ING WILR DDS S K E I I T V 
211 ATTTACAATT CTCAAACTGA CATTAACGGA TGGATCCTCC GCGACGACAG CAGCAAAGAA ATAATCACCG 
FRG TGS DTNL QLD TNY TLTP FDT 

2 81 TCTTCCGTGG CACTGGTAGT GATACGAATC TACAACTCGA TACTAACTAC ACCCTCACGC CTTTCGACAC 

•LPQ CNGC EVH GGY YIGW VSV QDQ 

3 51 CCTACCACAA TGCAACGGTT GTGAAGTACA CGGTGGATAT TATATTGGAT GGGTCTCCGT CCAGGACCAA 

VESL VKQ QVS QYPD YAL TVT GHXL 
421 GTCGAGTCGC TTGTCAAACA GCAGGTTAGC CAGTATCCGG ACTACGCGCT GACCGTGACC GGCCACKCCC 
• GAS L A A LTAA QLS ATY DNIR LYT 

4 91 TCGGCGCCTC CCTGGCGGCA CTCACTGCCG CCCAGCTGTC TGCGACATAC GACAACATCC GCCTGTACAC 

Xhol 

■FGE PRSG NQA FAS YMND AFQ ASS 
561 CTTCGGCGAA CCGCGCAGCG GCAATCAGGC CTTCGCGTCG TACATGAACG ATGCCTTCCA AGCCTCGAGC 

PDTT QYF RVT HAND GIP NLP PVEQ 
631 CCAGATACGA CGCAGTATTT CCGGGTCACT CATGCCAACG ACGGCATCCC AAACCTGCCC CCGGTGGAGC 
Ncol 



• GYA HGG VEYW SVD PYS AQNT FVC 
7 01 AGGGGTACGC GCATGGCGGT GTAGAGTACT GGAGCGTTGA TCCTTACAGC GCCCAGAACA CATTTGTCTG 

* T G D EVQC CEA QGG QGVN NAH TTY 
771 CACTGGGGAT GAAGTGCAGT GCTGTGAGGC CCAGGGCGGA CAGGGTGTGA ATAATGCGCA CACGACTTAT 

SphI NotI 



FGMT SGA CTW PVAA AEP LKD EL* 
841 TTTGGGATGA CGAGCGGCGC ATGCACCTGG CCGGTCGCGG CCGCGGAACC ACTGAAGGAT GAGCTGTAAA 
911 GAAGCAGATC GTTCAAACAT TTGGCAATAA AGTTTCTTAA GATTGAATCC TGTTGCCGGT CTTGCGATGA 
981 TTATCATATA ATTTCTGTTG AATTACGTTA AGCATGTAAT AATTAACATG TAATGCATGA CGTTATTTAT 
1051 GAGATGGGTT TTTATGATTA GAGTCCCGCA AT TAT AC ATT TAATACGCGA TAGAAAACAA AATATAGCGC 

Hindll I 



Clal Xbal 



1121 GCAAACTAGG ATAAATTATC GCGCGCGGTG TCATCTATGT TACTAGATCG ATAAGCTTCT AGAGCGGCCG 

1191 GTGGAGCTCC AATTCGCCCT ATAGTGAGTC GTATTACGCG CGCTCACTGG CCGTCGTTTT ACAACGTCGT 

1261 GACTGGGAAA ACCCTGGCGT TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA 

1331 ATAGCGAAGA GGCCCGCACC GATCGCCCTT CCGAACAGTT GCGCAGCCTG AATGGCGAAT GGGACGCGCC 

14 01 CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG CGCAGCGTGA CCGCTACACT TGCCAGCGCC 

14 71 CTAGCGCCCG CTCCTTTCGC TTTCTTCCCT TCCTTTCTCG CCACGTTCGC CGGCTTTCCC CGTCAAGCTC 

1541 TAAATCGGGG GCTCCCTTTA GGGTTCCGAT TTAGTGCTTT ACGGCACCTC GACCCCAAAA AACTTGATTA 

1611 GGGTGATGGT TCACGTAGTG GGCCATCGCC CTGATAGACG GTTTTTCGCC CTTTGACGTT GGAGTCCACG 

16 81 TTCTTTAATA GTGGACTCTT GTTCCAAACT GGAACAACAC TCAACCCTAT CTCGGTCTAT TCTTTTGATT 

1751 TATAAGGGAT TTTGCCGATT TCGGCCTATT GGTTAAAAAA TGAGCTGATT TAACAAAAAT TTAACGCGAA 



1821 TTTTAACAAA ATATTAACGC TTACAATTTA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA ACCCCTATTT 

1891 GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA CCCTGATAAA TGCTTCAATA 

1961 ATATTGAAAA AGGAAGAGTA TGAGTATTCA ACATTTCCGT GTCGCCCTTA TTCCCTTTTT TGCGGCATTT 

2031 TGCCTTCCTG TTTTTGCTCA CCCAGAAACG CTGGTGAAAG TAAAAGATGC TGAAGATCAG TTGGGTGCAC 

2101 GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT CCTTGAGAGT TTTCGCCCCG AAGAACGTTT 

2171 TCCAATGATG AGCACTTTTA AAGTTCTGCT ATGTGGCGCG GTATTATCCC GTATTGACGC CGGGCAAGAG 

2241 CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG TTGAGTACTC ACCAGTCACA GAAAAGCATC 

2311 TTACGGATGG CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACCATG AGTGATAACA CTGCGGCCAA 

23 81 CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC GCTTTTTTGC ACAACATGGG GGATCATGTA 

24 51 ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG AATGAAGCCA TACCAAACGA CGAGCGTGAC ACCACGATGC 
2521 CTGTAGCAAT GGCAACAACG TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA 

25 91 AT T AAT AG AC TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG 
2 6 61 TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG GGGCCAGATG 
2 731 GTAAGCCCTC CCGTATCGTA GTTATCTACA CGACGGGGAG TGAGGCAACT ATGGATGAAC GAAATAGACA 
28 01 GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA CTGTCAGACC AAGTTTACTC AT AT AT ACT T 

2 871 TAGATTGATT TAAAACTTCA TTTTTAATTT AAAAGG AT C T AGGT GAAGAT CCTTTTTGAT AATCTCATGA 
2941 CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC 

3 011 TTGAGATCCT TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT 
3 0 81 TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG CAGATACCAA 
3151 ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT 
3221 CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG GTTGGACTCA 

32 91 AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG 

33 61 AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA GCTATGAGAA AGCGCCACGC TTCCCGAAGG 
3431 GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG 
3501 GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT 
3 5 71 GCTCGTCAGG GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 
3 641 CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT TACCGCCTTT 
3 711 GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC AGTGAGCGAG GAAGCGGAAG 
3 781 AGCGCCCAAT ACGCAAACCG CCTCTCCCCG CGCGTTGGCC GATTCATTAA TGCAGCTGGC ACGACAGGTT 
3 851 TCCCGACTGG AAAGCGGGCA GTGAGCGCAA CGCAATTAAT GTGAGTTAGC TCACTCATTA GGCACCCCAG 
3 921 GCTTTACACT TTATGCTTCC GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT CACACAGGAA 

Kpnl 

3 991 ACAGCTATGA CCATGATTAC GCCAAGCGCG CAATTAACCC TCACTAAAGG GAACAAAAGC TGGGTACCGG 
Xhol 



4 061 GCCCCCCCTC GAGGTCATTC ATATGCTTGA GAAGAGAGTC GGGATAGTCC AAAATAAAAC AAAGGTAAGA 
4131 TTACCTGGTC AAAAGTGAAA ACATCAGTTA AAAGGTGGTA TAAGTAAAAT AT C GGTAAT A AAAGGTGGCC 
42 01 CAAAGTGAAA TTTACTCTTT TCTACTATTA TAAAAATTGA GGATGTTTTG TCGGTACTTT GATACGTCAT 
4271 TTTTGTATGA ATTGGTTTTT AAGTTTATTC GCGATTTGGA AATGCATATC TGTATTTGAG TCGGTTTTTA 
4 341 AGTTCGTTGC TTTTGTAAAT ACAGAGGGAT TTGTATAAGA AATATCTTTA AAAAACCCAT ATGCTAATTT 

EcoRI 



4411 GACATAATTT TTGAGAAAAA TATATATTCA GGCGAATTCC ACAATGAACA ATAATAAGAT TAAAATAGCT 

44 81 TGCCCCCGTT GCAGCGATGG GTATTTTTTC TAGTAAAATA AAAGATAAAC TTAGACTCAA AAC AT TTACA 

4551 AAAACAACCC CTAAAGTCCT AAAGCCCAAA GTGCTATGCA CGATCCATAG CAAGCCCAGC CCAACCCAAC 

4 621 CCAACCCAAC CCACCCCAGT GCAGCCAACT GGCAAATAGT CTCCACCCCC GGCACTATCA CCGTGAGTTG 

4 6 91 TCCGCACCAC CGCACGTCTC GCAGCCAAAA AAAAAAAAAG AAAGAAAAAA AAGAAAAAGA AAAACAGCAG 

4 761 GTGGGTCCGG GTCGTGGGGG CCGGAAAAGC GAGGAGGATC GCGAGCAGCG ACGAGGCCCG GCCCTCCCTC 

4 831 CGCTTCCAAA GAAACGCCCC CCATCGCCAC TATATACATA CCCCCCCCTC TCCTCCCATC CCCCCAACCC 

4 901 TACCACCACC ACCACCACCA CCTCCTCCCC CCTCGCTGCC GGACGACGAG CTCCTCCCCC CTCCCCCTCC 

4 971 GCCGCCGCCG GTAACCACCC CGCCCCTCTC CTCTTTCTTT CTCCGTTTTT TTTTTCGTCT CGGTCTCGAT 

5 041 CTTTGGCCTT GGTAGTTTGG GTGGGCGAGA GCGGCTTCGT CGCCCAGATC GGTGCGCGGG AGGGGCGGGA 

BamHI Bglll 



5111 TCTCGCGGCT GGCGTCTCCG GGCGTGAGTC GGCCCGGATC CTCGCGGGGA ATGGGGCTCT CGGATGTAGA 
Bglll 

5181 TCTTCTTTCT TTCTTCTTTT TGTGGTAGAA TTTGAATCCC TCAGCATTGT TCATCGGTAG TTTTTCTTTT 

5251 CATGATTTGT GACAAATGCA GCCTCGTGCG GAGCTTTTTT GTAGC 



flssHII (4961) 



Kpnl(l) 
Nco I (1) 
Pstl(9\) 




Xbal(U3) 

Xba\{\2\) 
Sma I (367) 
SEE promoter 



f1 ori 



Sph I (523) 
Pstl(m) 
SEEsignal+NPIR 
\ 5ssHII(976) 
. aleurain signal/NPIR 
Notl(m) 
Sa/I(1081) 
icd(1082) 
Sam HI (1137) 
Glyeosylation 
Intron 1 
FAE 

Xhol(\5\9) 
Nco I (1606) 
Redundancy (stop) 
KDEL 



^ A- 



nos 

BssHII (2062) 
BssHll (2086) 
C/o I (2113) 
ffiudlll (2118) 
Xbal(2\23) 
BssEll (2173) 



Sequence for pUB8,ll 



NCOl 



Kpnl 



1 CATGGGCCAG GTATAATTAT GGGATATCTC AAGCAAATAA TCGAAATATC 
ACCATTGGCT ACAATATCTG 

GTACCCGGTC CATATTAATA CCCTATAGAG TTCGTTTATT AGCTTTATAG 
TGGTAACCGA TGTTATAGAC 



71 AGCTCCGAGT TCTGACTGCA GTCTGGATGA CGCGTGTTGT ATCTAGAACT 
CTAGATAGCA CAGCCACAGC 

TCGAGGCTCA AGACTGACGT CAGACCTACT GCGCACAACA TAGATCTTGA 
GATCTATCGT GTCGGTGTCG 

141 ACCTACAGGA GTGCGACACT TGTGGACTGT AGTAGTGTTG GAGACGGAGC 
TCTTTCCTAC CTCCTGACGT 

TGGATGTCCT CACGCTGTGA AC AG C TGAC A TCATCACAAC CTCTGCCTCG 
AGAAAGGATG GAGGACTGCA 

211 TGCCGCCGTT GTCCATTCCA ACGGCATCAC TCTCAACCAA TCACGCGCTC 
CCAACAAAAT ATCGTCCCCC 

ACGGCGGCAA CAGGTAAGGT TGCCGTAGTG AGAGTTGGTT AGTGCGCGAG 
GGTTGTTTTA TAGCAGGGGG 

281 ATGTCTTGGC GGAGAGAGAG TACATACATG CTGTCGCGCC GTTTTTGTCT 
GAATCTCGCT TCCACTGGCC 

TACAGAACCG CCTCTCTCTC ATGTATGTAC GACAGCGCGG CAAAAACAGA 

CTTAGAGCGA AGGTGACCGG 



3 51 AATCAGCTCA GCTCCCGGGA GCTCACTCAT TCAAGATCCC ATCGTCGTCG 
TCACCCCTGG CGTCATGGGA 

TTAGTCGAGT CGAGGGCCCT CGAGTGAGTA AGTTCTAGGG TAGCAGCAGC 
AGTGGGGACC GCAGTACCCT 

421 TGGAAAAGAA CCTCCGTTGC TCGGATGAGT CAGCCATATC CCCGAACAGA 
GTACTGCAAG ATAACCCAAT 

ACCTTTTGTT GGAGGGAACG AGCCTACTCA GTCGGTATAG GGGCTTGTCT 
CATGACGTTC TATTGGGTTA 



4 91 TCAGATTCCC CCAATAGAGA AAGTATAGCA TGCTTTCGGG TTTTGTTTGG 
CTTAATTGAC TTTATTTTTG 

AGTCTAAGGG GGTTATCTCT TTCATATCGT ACGAAAGCCC AAAACAAACC 
GAATTAACTG AAATAAAAAC 



PstI 



Xbal 



Xbal 



Smal 



SphI 




561 TTGGAGTTGA ATGCTGATTT GTTGTGTAAA ATGCCCAACC ATCTGAATAT 
CGAGACGGAT AATAGGCTGG 

AACCTCAACT TACGACTAAA CAACACATTT TACGGGTTGG TAGACTTATA 
GCTCTGCCTA TTATCCGACC 

631 CTAATTAATT TATAGCAAGA TTCTGTAGTG CACATCGCAA ATATCTTTCT 
GGGCATTACA GCTGGAGGCT 

GATTAATTAA ATATCGTTCT AAGACATCAC GTGTAGCGTT TATAGAAAGA 
CCCGTAATGT GGACCTCCGA 



PstI 



7 01 TCATCAGCCT GAAACACTCT GCAGAGCCTG AAGCAAGTGG TGAAGCGTGG 
CGATGAGATG GGTATAAAAC 

AGTAGTCGGA CTTTGTGAGA CGTCTCGGAC TTCGTTCACC ACTTCGCACC 
GCTACTCTAC CCATATTTTG 

7 71 CCCCGGCACC GGGACGCGAG CTCCCGCCTA CCAGTACCAT CTCGCCTCGC 
TCCCCCTGCC GGACGACCCA 

GGGGCCGTGG CCCTGCGCTC GAGGGCGGAT GGTCATGGTA GAGCGGAGCG 

AGGGGGACGG CCTGCTGGGT 

841 GTAAAATACT GTTGCCCACT CGCCGGCGAG ATGGCCCACG GCCGCATCCT 
CTTCTTGGGG CTCGCCGTCT 

CATTTTATGA CAACGGGTGA GCGGCCGCTC TACCGGGTGC CGGCGTAGGA 
GAAGAACCGC GAGCGGCAGA 



BssHII 



Not I 



911 TGGCCACCGC CGCGGTGGCC GCCGCATCNT TGGCGGACTC CAACCCGATC 
CGGCCCGTCA CCGAGCGCGC 

ACCGGTGGCG GCGCCACCGG CGGCGTAGNA ACCGCCTGAG GTTGGGCTAG 

GCCGGGCAGT GGCTCGCGCG 



Not I 



981 GGCCGCCTCC ACGCAGGGCA TCTCCGAAGA CCTCTACAGC CGTTTAGTCG 
AAATGGCCAC TATCTCCCAA 

CCGGCGGAGG TGCGTCCCGT AGAGGCTTCT GGAGATGTCG GCAAATCAGC 

TTTACCGGTG ATAGAGGGTT 



Sail 



AccI 



1051 GCTGCCTACG CCGACCTGTG CAACATTCCG TCGACTATTA TCAAGGGAGA 
GAAAATTTAC AATTCTCAAA 



CGACGGATGC GGCTGGACAC GTTGTAAGGC AGCTGATAAT AGTTCCCTCT 



CTTTTAAATG TTAAGAGTTT 



BamHI 



1121 CTGACATTAA CGGATGGATC CTCCGCGACG ACAGCAGCAA AGAAATAATC 
ACCGTCTTCC GTGGCACTGG 

GACTGTAATT GCCTACCTAG GAGGCGCTGC TGTCGTCGTT TCTTTATTAG 
TGGCAGAAGG CACCGTGACC 

1191 TAGTGATACG AATCTACAAC TCGATACTAA CTACACCCTC ACGCCTTTCG 
ACACCCTACC ACAATGCAAC 

ATCAGTATGC TTAGATGTTG AGCTATGATT GATGTGGGAG TGCGGAAAGC 
TGTGGGATGG TGTTACGTTG 

12 61 GGTTGTGAAG TACACGGTGG ATATTATATT GGATGGGTCT CCGTCCAGGA 
CCAAGTCGAG TCGCTTGTCA 

CCAACACTTC ATGTGCCACC TATAATATAA CCTACCCAGA GGCAGGTCCT 

GGTTCAGCTC AGCGAACAGT 

1331 AACAGCAGGT TAGCCAGTAT GCGGACTACG CGCTGACCGT GACCGGCCAC 
KCCCTCGGCG CCTCCCTGGC 

TTGTCGTCCA ATCGGTCATA GGCCTGATGC GCGACTGGCA CTGGCCGGTG 

MGGGAGCCGC GGAGGGACCG 

14 01 GGCACTCACT GCCGCCCAGC TGTCTGCGAC ATACGACAAC ATCCGCCTGT 
ACACCTTCGG CGAACCGCGC 

CCGTGAGTGA CGGCGGGTCG ACAGACGCTG TATGCTGTTG TAGGCGGACA 

TGTGGAAGCC GCTTGGCGCG 



1471 AGCGGCAATC AGGCCTTCGC GTCGTACATG AACGATGCCT TCCAAGCCTC 
GAGCCCAGAT ACGACGCAGT 

TCGCCGTTAG TCCGGAAGCG CAGCATGTAC TTGCTACGGA AGGTTCGGAG 
CTCGGGTCTA TGCTGCGTCA 



1541 ATTTGCGGGT CACTCATGCC AACGACGGCA TCCCAAACCT GCCCCCGGTG 
GAGCAGGGGT ACGCCCATGG 

TAAAGGCCCA GTGAGTACGG TTGCTGCCGT AGGGTTTGGA CGGGGGCCAC 
CTCGTCCCCA TGCGGGTACC 

1611 CGGTGTAGAG TACTGGAGCG TTGATCCTTA CAGCGCCCAG AACACATTTG 
TCTGCACTGG GGATGAAGTG 

GCCACATCTC ATGACCTCGC AACTAGGAAT GTCGCGGGTC TTGTGTAAAC 
AGACGTGACC CCTACTTCAC 

1681 CAGTGCTGTG AGGCCCAGGG CGGACAGGGT GTGAATAATG CGCACACGAC 
TTATTTTGGG ATGACGAGCG 



Xhol 



Ncol 




GTCACGACAC TCCGGGTCCC GCCTGTCCCA CACTTATTAC GCGTGTGCTG 
AATAAAACCC TACTGCTCGC 

1751 GAGCCTGTAC ATGGTGATCA GTCATTTCAG CCTCCCCGAG TGTACCAGGA 

AAGATGGATG TCCTGGAGAG 

CTCGGACATG TACCACTAGT CAGTAAAGTC GGAGGGGCTC ACATGGTCCT 

TTCTACCTAC AGGACCTCTC 

1821 GGGGCCGCGT AACCACTGAA GGATGAGCTG TAAAGAAGCA GATCGTTCAA 

ACATTTGGCA ATAAAGTTTC 

CCCCGGCGCA TTGGTGACTT CCTACTCGAC ATTTCTTCGT CTAGCAAGTT 

TGTAAACCGT TATTTCAAAG 

1891 TTAAGATTGA ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT 

GTTGAATTAC GTTAAGCATG 

AATTCTAACT TAGGACAACG GCCAGAACGC TACTAATAGT ATATTAAAGA 

CAACTTAATG CAATTCGTAC 

1961 TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG 

ATTAGAGTCC CGCAATTATA 

ATTATTAATT GTACATTACG TACTGCAATA AATACTCTAC CCAAAAATAC 

TAATCTCAGG GCGTTAATAT 

BSSHII 

BssHII 



2 031 CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 
TATCGCGCGC GGTGTCATCT 

GTAAATTATG CGCTATCTTT TGTTTTATAT CGCGCGTTTG ATCCTATTTA 
ATAGCGCGCG CCACAGTAGA 



Xbal 

Clal Hindi I I 



2101 ATGTTACTAG ATCGATAAGC TTCTAGAGCG GCCGGTGGAG CTCCAATTCG 
CCCTATAGTG AGTCGTATTA 

TACAATGATC TAGCTATTCG AAGATCTCGC CGGCCACCTC GAGGTTAAGC 
GGGAT AT C AC TCAGCATAAT 

BSSHII 

2171 CGCGCGCTCA CTGGCCGTCG TTTTACAACG TCGTGACTGG GAAAACCCTG 
GCGTTACCCA ACTTAATCGC 

GCGCGCGAGT GACCGGCAGC AAAATGTTGC AGCACTGACC CTTTTGGGAC 

CGCAATGGGT TGAATTAGCG 

2241 CTTGCAGCAC ATCCCCCTTT CGCCAGCTGG CGTAATAGCG AAGAGGCCCG 
CACCGATCGC CCTTCCCAAC 

GAACGTCGTG TAGGGGGAAA GCGGTCGACC GCATTATCGC TTCTCCGGGC 

GTGGCTAGCG GGAAGGGTTG 



2311 AGTTGCGCAG CCTGAATGGC GAATGGGACG CGCCCTGTAG CGGCGCATTA 
AGCGCGGCGG GTGTGGTGGT 

TCAACGCGTC GGACTTACCG CTTACCCTGC GCGGGACATC GCCGCGTAAT 
TCGCGCCGCC CACACCACCA 

23 81 TACGCGCAGC GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT 
TCGCTTTCTT CCCTTCCTTT 

ATGCGCGTCG CACTGGCGAT GTGAACGGTC GCGGGATCGC GGGCGAGGAA 
AGCGAAAGAA GGGAAGGAAA 

2451 CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC GGGGGCTCCC 
TTTAGGGTTC CGATTTAGTG 

GAGCGGTGCA AGCGGCCGAA AGGGGCAGTT CGAGATTTAG CCCCCGAGGG 
AAATGCCAAG GCTAAATCAC 

2 521 CTTTACGGCA CCTCGACCCC AAAAAACTTG ATTAGGGTGA TGGTTCACGT 
AGTGGGCCAT CGCCCTGATA 

GAAATGCCGT GGAGCTGGGG TTTTTTGAAC TAATCCCACT ACCAAGTGCA 
TCACCCGGTA GCGGGACTAT 

2591 GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT AATAGTGGAC 
TCTTGTTCCA AACTGGAACA 

CTGCCAAAAA GCGGGAAACT GCAACCTCAG GTGCAAGAAA TTATCACCTG 
AGAACAAGGT TTGACCTTGT 

2 6 61 ACACTCAACC CTATCTCGGT CTATTCTTTT GATTTATAAG GGATTTTGCC 
GATTTCGGCC TATTGGTTAA 

TGTGAGTTGG GATAGAGCCA GATAAGAAAA CTAAATATTC CCTAAAACGG 
CTAAAGCCGG ATAACCAATT 

2731 AAAATGAGCT GATTTAAGAA AAATTTAACG CGAATTTTAA CAAAATATTA 
ACGCTTACAA TTTAGGTGGC 

TTTTACTCGA CTAAATTGTT TTTAAATTGC GCTTAAAATT GTTTTATAAT 
TGCGAATGTT AAATCCACCG 

2801 ACTTTTCGGG GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT 
ACATTCAAAT ATGTATCCGC 

TGAAAAGCCC CTTTACACGC GCCTTGGGGA TAAACAAATA AAAAGATTTA 
TGTAAGTTTA TACATAGGCG 

2 8 71 TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG 
AGTATGAGTA TTCAACATTT 

AGTACTCTGT TATTGGGACT ATTTACGAAG TTATTATAAC TTTTTCCTTC 
TCATACTCAT AAGTTGTAAA 

2 941 CCGTGTCGCC CTTATTCCCT TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG 
CTCACCCAGA AACGCTGGTG 

GGCACAGCGG GAATAAGGGA AAAAACGCCG TAAAACGGAA GGACAAAAAC 
GAGTGGGTCT TTGCGACCAC 

3 011 AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT GCACGAGTGG GTTACATCGA 
ACTGGATCTC AACAGCGGTA 

TTTCATTTTC TACGACTTCT AGTCAACCCA CGTGCTCACC CAATGTAGCT 
TGACCTAGAG TTGTCGCCAT 




3 081 AGATCCTTGA GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT GATGAGCACT 

TTTAAAGTTC TGCTATGTGG 

TCTAGGAACT CTCAAAAGCG GGGCTTCTTG CAAAAGGTTA CTACTCGTGA 

AAATTTCAAG ACGATACACC 

3151 CGCGGTATTA TCCCGTATTG ACGCCGGGCA AGAGCAACTG GGTCGCCGCA 
TACACTATTC TCAGAATGAC 

GCGCCATAAT AGGGCATAAC TGCGGCCCGT TCTCGTTGAG CCAGCGGCGT 

ATGTGATAAG AGTCTTACTG 

3221 TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC 
AGTAAGAGAA TTATGCAGTG 

AACCAACTCA TGAGTGGTCA GTGTCTTTTC GTAGAATGCC TACCGTACTG 
TCATTCTCTT AATACGTCAC 

32 91 CTGCCATAAC CATGAGTGAT AACACTGCGG CCAACTTACT TCTGACAACG 
AT CGGAGGAC CGAAGGAGCT 

GACGGTATTG GTACTCACTA TTGTGACGCC GGTTGAATGA AGACTGTTGC 
TAGCCTCCTG GCTTCCTCGA 

3361 AACCGCTTTT TTGCACAACA TGGGGGATGA TGTAACTCGC CTTGATCGTT 
GGGAACCGGA GCTGAATGAA 

TTGGCGAAAA AACGTGTTGT ACCCCCTAGT ACATTGAGCG GAAC TAGCAA 
CCCTTGGCCT CGACTTACTT 

3431 GCCATACCAA ACGACGAGCG TGACACCACG ATGCCTGTAG CAATGGCAAC 
AACGTTGCGC AAAC TATTAA 

CGGTATGGTT TGCTGCTCGC ACTGTGGTGC TACGGACATC GTTACCGTTG 

TTGCAACGCG TTTGATAATT 

3 501 CTGGCGAACT ACTTACTCTA GCTTCCCGGC AACAATTAAT AGACTGGATG 
GAGGCGGATA AAGTTGCAGG 

GACCGCTTGA TGAATGAGAT CGAAGGGCCG TTGTTAATTA TCTGACCTAC 
CTCCGCCTAT TTCAACGTCC 

3 5 71 ACCACTTCTG CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT 
CTGGAGCCGG TGAGCGTGGG 

TGGTGAAGAC GCGAGCCGGG AAGGCCGACC GACCAAATAA CGACTATTTA 
GACCTCGGCC ACTCGCACCC 

3 641 TCTCGCGGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC CCTCCCGTAT 
CGTAGTTATC TACACGACGG 

AGAGCGCCAT AGTAACGTCG TGACCCCGGT CTACCATTCG GGAGGGCATA 
GCATCAATAG ATGTGGTGCC 

3 711 GGAGTCAGGC AACTATGGAT GAAC GAAAT A GACAGATCGC TGAGATAGGT 
GCCTCACTGA TTAAGCATTG 

CCTCAGTCCG TTGATACCTA CTTGCTTTAT CTGTCTAGCG ACTCTATCCA 
CGGAGTGACT AATTCGTAAC 

3 781 GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT GATTTAAAAC 
TTCATTTTTA ATTTAAAAGG 




CATTGACAGT CTGGTTCAAA TGAGTATATA TGAAAT CTAA CTAAATTTTG 
AAGTAAAAAT TAAATTTTCC 



3 8 51 ATCTAGGTGA AGATCCTTTT TGATAATCTC ATGACCAAAA TCCCTTAACG 
TGAGTTTTCG TTCCACTGAG 

TAGATCCACT TCTAGGAAAA ACTATTAGAG TACTGGTTTT AGGGAATTGC 

ACTCAAAAGC AAGGTGACTC 

3 921 CGTCAGACCC CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT 
CTGCGCGTAA TCTGCTGCTT 

GCAGTGTGGG GCATCTTTTC TAGTTTCCTA GAAGAACTCT AGGAAAAAAA 

GACGCGCATT AGACGACGAA 

3 991 GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG 
AGCTACCAAC TCTTTTTCCG 

CGTTTGTTTT TTTGGTGGCG ATGGTCGCCA CCAAACAAAC GGCCTAGTTC 

TCGATGGTTG AGAAAAAGGC 

4061 AAGGTAACTG GCTTCAGCAG AGCGCAGATA CCAAATACTG TCCTTCTAGT 
GTAGCCGTAG TTAGGCCACC 

TTCCATTGAC CGAAGTCGTC TCGCGTCTAT GGTTTATGAC AGGAAGATCA 

CATCGGCATC AATCCGGTGG 

4131 ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG 
TTACCAGTGG CTGCTGCCAG 

TGAAGTTCTT GAGACATCGT GGCGGATGTA TGGAGCGAGA CGATTAGGAC 
AATGGTCACC GACGACGGTC 

42 01 TGGCGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA TAGTTACCGG 
ATAAGGCGCA GCGGTCGGGC 

ACCGCTATTC AGCACAGAAT GGCCCAACCT GAGTTCTGCT ATCAATGGCC 
TATTCCGCGT CGCCAGGCCG 

4271 TGAACGGGGG GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA CGACCTACAC 
CGAAC TGAGA TACCTACAGC 

ACTTGCCCCC CAAGCACGTG TGTCGGGTCG AACCTCGCTT GCTGGATGTG 
GCTTGACTCT ATGGATGTCG 

4341 GTGAGCTATG AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA GGCGGACAGG 
TATCCGGTAA GCGGCAGGGT 

CACTCGATAC TCTTTCGCGG TGCGAAGGGC TTCCCTCTTT CCGCCTGTCC 
ATAGGCCATT CGCCGTCCCA 

4411 CGGAACAGGA GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC 
TTTATAGTCC TGTCGGGTTT 

GCCTTGTCCT CTCGCGTGCT CCCTCGAAGG TCCCCCTTTG CGGACCATAG 
AAATATCAGG ACAGCCCAAA 

4481 CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG 
GAGCCTATGG AAAAACGC CA 

GCGGTGGAGA CTGAACTCGC AGCTAAAAAC ACTACGAGCA GTCCCCCCGC 
CTCGGATACC TTTTTGCGGT 




4551 GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC 
ATGTTCTTTC CTGCGTTATC 

CGTTGCGCCG GAAAAATGCC AAGGACCGGA AAACGACCGG AAAACGAGTG 
TACAAGAAAG GACGCAATAG 

4621 CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG 
CTCGCCGCAG CCGAACGACC 

GGGAC TAAGA CACCTATTGG CATAATGGCG GAAACTCACT CGACTATGGC 
GAGCGGCGTC GGCTTGCTGG 

4 691 GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC CAATACGCAA 
ACCGCCTCTC CCCGCGCGTT 

CTCGCGTCGC TCAGTCACTC GCTCCTTCGC CTTCTCGCGG GTTATGCGTT 

TGGCGGAGAG GGGCGCGCAA 

4761 GGCCGATTCA TTAATGCAGC TGGCACGACA GGTTTCCCGA CTGGAAAGCG 
GGCAGTGAGC GCAACGCAAT 

CCGGCTAAGT AATTACGTCG ACCGTGCTGT CCAAAGGGCT GACCTTTCGC 
CCGTCACTCG CGTTGCGTTA 

4831 TAATGTGAGT TAGCTCACTC ATTAGGCACC CCAGGCTTTA CACTTTATGC 
TTCCGGCTCG TATGTTGTGT 

ATTACACTCA ATCGAGTGAG TAATCCGTGG GGTCCGAAAT GTGAAATACG 
AAGGCCGAGC ATACAACACA 



4 901 GGAATTGTGA GCGGATAACA ATTTCACACA GGAAACAGCT ATGAC CATGA 
TTACGCCAAG CGCGCAATTA 

CCTTAACACT CGCCTATTGT TAAAGTGTGT CCTTTGTCGA TACTGGTACT 
AATGCGGTTC GCGCGTTAAT 



BSSHII 



NCOI 



Kpnl 



4 971 ACCCTCACTA AAGGGAACAA AAGCTGGGTA C 
TGGGAGTGAT TTCCCTTGTT TTCGACCCAT G 




Figure j£l Pr 




Figure H1& 



Ncol 



Hindlll 

M AHA RVL LLAL A V L ATA AVAV 
AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTCGCCG 

NotI 



ASS SSF ADSN P I R PVT DRAA AST 
71 TCGCCTCCTC CTCCTCCTTC GCCGACTCCA ACCCGATCCG GCCCGTCACC GACCGCGCGG CCGCCTCCAC 
- Q G I SEDL YSR L V E MATI SQA AYA 
141 GCAGGGCATC TCCGAAGACC TCTACAGCCG TTTAGTCGAA ATGGCCACTA TCTCCCAAGC TGCCTACGCC 

Sail 

ACCl 

D L C N IPS Til KGEK I Y N SQT DING 
211 GACCTGTGCA ACATTCCGTC GACTATTATC AAGGGAGAGA AAATTTACAA TTCTCAAACT GACATTAACG 
BamHI 



. W I L RDD SSKE I I T VFR GTGS D T N • 
281 GATGGATCCT CCGCGACGAC AGCAGCAAAG AAATAATCAC CGTCTTCCGT GGCACTGGTA GTGATACGAA 

-LQL DTNY TLT PFD TLPQ CNG CEV 
351 TCTACAACTC GATACTAACT ACACCCTCAC GCCTTTCGAC ACCCTACCAC AATGCAACGG TTGTGAAGTA 

HGGY YIG WVS VQDQ VES LVK QQVS 
421 CACGGTGGAT ATTATATTGG ATGGGTCTCC GTCCAGGACC AAGTCGAGTC GCTTGTCAAA CAGCAGGTTA 

• QYP DYA LTVT GHX LGA SLAA L T A • 
4 91 GCCAGTATCC GGACTACGCG CTGACCGTGA CCGGCCACKC CCTCGGCGCC TCCCTGGCGG CACTCACTGC 

-AQL SATY DNI RLY TFGE PRS GNQ 
561 CGCCCAGCTG TCTGCGACAT ACGACAACAT CCGCCTGTAC ACCTTCGGCG AACCGCGCAG CGGCAATCAG 

Xhol 



AFAS YMN DAF QASS PDT TQY FRVT 
631 GCCTTCGCGT CGTACATGAA CGATGCCTTC CAAGCCTCGA GCCCAGATAC GACGCAGTAT TTCCGGGTCA 

Ncol 



. HAN DGI PNLP PVE QGY AHGG VEY- 
701 CTCATGCCAA CGACGGCATC CCAAACCTGC CCCCGGTGGA GCAGGGGTAC GCCCATGGCG GTGTAGAGTA 

. W S V DPYS AQN TFV CTGD EVQ CCE 
771 CTGGAGCGTT GATCCTTACA GCGCCCAGAA CACATTTGTC TGCACTGGGG ATGAAGTGCA GTGCTGTGAG 

AQGG QGV NNA HTTY FGM TSG ACTW 
841 GCCCAGGGCG GACAGGGTGT GAATAATGCG CACACGACTT ATTTTGGGAT GACGAGCGGA GCCTGTACAT 
* 

911 GGTGATCAGT CATTTCAGCC TCCCCGAGTG TACCAGGAAA GATGGATGTC CTGGAGAGGG GGCCGCGTAA 
981 CCACTGAAGG ATGAGCTGTA AAGAAGCAGA TCGTTCAAAC ATTTGGCAAT AAAGTTTCTT AAGATTGAAT 
1051 CCTGTTGCCG GTCTTGCGAT GATTATCATA TAATTTCTGT TGAATTACGT TAAGCATGTA ATAATTAACA 
1121 TGTAATGCAT GACGTTATTT ATGAGATGGG TTTTTATGAT TAGAGTCCCG CAATTATACA TTTAATACGC 

Clal 

1191 GATAGAAAAC AAAATATAGC GCGCAAACTA GGATAAATTA TCGCGCGCGG TGTCATCTAT GTTACTAGAT 
Hindi I I 



Clal Xbal 



12 61 CGATAAGCTT CTAGAGCGGC CGGTGGAGCT CCAATTCGCC CTATAGTGAG TCGTATTACG CGCGCTCACT 

13 31 GGCCGTCGTT TTACAACGTC GTGACTGGGA AAACCCTGGC GTTACCCAAC TTAATCGCCT TGCAGCACAT 

14 01 CCCCCTTTCG CCAGCTGGCG TAATAGCGAA GAGGCCCGCA CCGATCGCCC TTCCCAACAG TTGCGCAGCC 
14 71 TGAATGGCGA ATGGGACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT 
1541 GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT CGCCACGTTC 
1611 GCCGGCTTTC CCCGTCAAGC TCTAAATCGG GGGCTCCCTT TAGGGTTCCG ATTTAGTGCT TTACGGCACC 
1681 TCGACCCCAA AAAACTTGAT TAGGGTGATG GTTCACGTAG TGGGCCATCG C C C T GAT AG A CGGTTTTTCG 



1751 CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA CTGGAACAAC ACTCAACGCT 

1821 ATCTCGGTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA AATGAGCTGA 

18 91 TTTAACAAAA ATTTAACGCG AATTTTAACA AAATATTAAC GCTTACAATT TAGGTGGCAC TTTTCGGGGA 

1961 AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC ATTCAAATAT GTATCCGCTC ATGAGACAAT 

2 031 AACCCTGATA AATGCTTCAA TAATATTGAA AAAGGAAGAG TATGAGTATT CAACATTTCC GTGTCGCCCT 

2101 TATTCCCTTT TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT CACCCAGAAA CGCTGGTGAA AGTAAAAGAT 

2171 GCTGAAGATC AGTTGGGTGC ACGAGTGGGT TACATCGAAC TGGATCTCAA CAG CGGTAAG ATCCTTGAGA 

2241 GTTTTCGCCC CGAAGAACGT TTTCCAATGA TGAGCACTTT TAAAGTTCTG CTATGTGGCG CGGTATTATC 

2311 CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA CACTATTCTC AGAATGACTT GGTTGAGTAC 

2 3 81 TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAG TAAGAGAATT ATGCAGTGCT GCCATAACCA 

2451 TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG AAGGAGCTAA CCGCTTTTTT 

2521 GCACAACATG GGGGATCATG TAACTCGCCT TGATCGTTGG GAACCGGAGC TGAATGAAGC CATACCAAAC 

2591 GACGAGCGTG ACACCACGAT GCCTGTAGCA ATGGCAACAA CGTTGCGCAA ACTATTAACT GGCGAACTAC 

2 661 TTACTCTAGC TTCCCGGCAA CAATTAATAG ACTGGATGGA GGCGGATAAA GTTGCAGGAC CACTTCTGCG 

2 731 CTCGGCCCTT CCGGCTGGCT GGTTTATTGC TGATAAATCT GGAGCCGGTG AGCGTGGGTC TCGCGGTATC 

2 801 ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA CACGACGGGG AGTCAGGCAA 

2 8 71 CTATGGATGA ACGAAATAGA CAGATCGCTG AGATAGGTGC CTCACTGATT AAGCATTGGT AACTGTCAGA 

2 941 CCAAGTTTAC TCATATATAC TTTAGATTGA TTTAAAACTT CATTTTTAAT TTAAAAGGAT CTAGGTGAAG 

3 011 ATCCTTTTTG ATAATCTCAT GACCAAAATC CCTTAACGTG AGTTTTCGTT CCACTGAGCG TCAGACCCCG 
3 0 81 TAGAAAAGAT CAAAGGATCT TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA 
3151 ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA GGTAACTGGC 
3221 TTCAGCAGAG CGCAGATACC AAATACTGTC CTTCTAGTGT AGCCGTAGTT AGGCCACCAC TTCAAGAACT 
32 91 CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT ACCAGTGGCT GCTGCCAGTG GCGATAAGTC 
3 3 61 GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT AAGGCGCAGC GGTCGGGCTG AACGGGGGGT 

34 31 TCGTGCACAC AGCCCAGCTT GGAGCGAACG ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG 

35 01 AAAGCGCCAC GCTTCCCGAA GGGAGAAAGG C GGACAGG T A TC CGGTAAG C GGCAGGGTCG GAACAGGAGA 
3571 GCGCACGAGG GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA 
3 641 CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC AACGCGGCCT 
3 711 TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT GTTCTTTCCT GCGTTATCCC CTGATTCTGT 
3 7 81 GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA GCGCAGCGAG 
3 851 TCAGTGAGCG AGGAAGCGGA AGAGCGCCCA ATACGCAAAC CGCCTCTCCC CGCGCGTTGG CCGATTCATT 
3 921 AATGCAGCTG GCACGACAGG TTTCCCGACT GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAGTTA 

3 991 GCTCACTCAT TAGGCACCCC AGGCTTTACA CTTTATGCTT CCGGCTCGTA TGTTGTGTGG AATTGTGAGC 

4 061 GGATAACAAT TTCACACAGG AAACAGCTAT GACCATGATT ACGCCAAGCG CGCAATTAAC CCTCACTAAA 

Kpnl Xhol 

4131 GGGAACAAAA GCTGGGTACC GGGCCCCCCC TCGAGGTCAT TCATATGCTT GAGAAGAGAG TCGGGATAGT 

4201 CCAAAATAAA ACAAAGGTAA GATTACCTGG TCAAAAGTGA AAACATCAGT TAAAAGGTGG TATAAGTAAA 

4271 ATATCGGTAA TAAAAGGTGG CCCAAAGTGA AATTTACTCT TTTCTACTAT TATAAAAATT GAGGATGTTT 

4341 TGTCGGTACT TTGATACGTC ATTTTTGTAT GAATTGGTTT TTAAGTTTAT TCGCGATTTG GAAATGCATA 
TCTGTATTTG AGTCGGTTTT TAAGTTCGTT GCTTTTGTAA ATACAGAGGG ATTTGTATAA GAAATATCTT 

ECORI 



4411 



44 81 TAAAAAACCC ATATGCTAAT TTGACATAAT TTTTGAGAAA AATATATATT CAGGCGAATT CCACAATGAA 
4551 CAATAATAAG ATTAAAATAG CTTGCCCCCG TTGCAGCGAT GGGTATTTTT TCTAGTAAAA TAAAAGATAA 
4621 ACTTAGACTC AAAACATTTA CAAAAACAAC CCCTAAAGTC CTAAAGCCCA AAGTGCTATG CACGATCCAT 
46 91 AGCAAGC C C A GCCCAACCCA ACCCAACCCA ACCCACCCCA GTGCAGCCAA CTGGCAAATA GTCTCCACCC 
4 761 CCGGCACTAT CACCGTGAGT TGTCCGCACC ACCGCACGTC TCGCAGCCAA AAAAAAAAAA AGAAAGAAAA 
4 831 AAAAGAAAAA GAAAAACAGC AGGTGGGTCC GGGTCGTGGG GGCCGGAAAA GCGAGGAGGA TCGCGAGCAG 
4 901 CGACGAGGCC CGGCCCTCCC TCCGCTTCCA AAGAAACGCC CCCCATCGCC ACTATATACA TACCCCCCCC 
4 971 TCTCCTCCCA TCCCCCCAAC CCTACCACCA CCACCACCAC CACCTCCTCC CCCCTCGCTG CCGGACGACG 
5041 AGCTCCTCCC CCCTCCCCCT CCGCCGCCGC CGGTAACCAC CCCGCCCCTC TCCTCTTTCT TTCTCCGTTT 
TTTTTTTCGT CTCGGTCTCG ATCTTTGGCC TTGGTAGTTT GGGTGGGCGA GAGCGGCTTC GTCGCCCAGA 

BamHI 



5111 



5181 TCGGTGCGCG GGAGGGGCGG GATCTCGCGG CTGGCGTCTC CGGGCGTGAG TCGGCCCGGA TCCTCGCGGG 

Bglll 



5251 
5321 



GAATGGGGCT CTCGGATGTA GATCTTCTTT CTTTCTTCTT TTTGTGGTAG AATTTGAATC CCTCAGCATT 
GTTCATCGGT AGTTTTTCTT TTCATGATTT GTGACAAATG CAGCCTCGTG CGGAGCTTTT TTGTAGC 



Figure £<DA 



Actin promoter -FAEs 



Bam HI (1 09 5) 
fig /II (1 1 27) 

pCOR deletion 

A7ioI(17) Htn d 10 (1248) 

Kpnl(6) £Yt>Rl(3 93) F U L L K p tl 1 -E C O R I deletion /VcoI(l 255 ) 




ACTIN -PROM OTER-FAEs 

1259 tp 



Kpnl Xhol 



1 


GGTACCGGGC 
CCATGGCCCG 


CCCCCCTCGA 
GGGGGGAGCT 


GGTCATTCAT 
CCAGTAAGTA 


ATGCTTGAGA 
TACGAACTCT 


AGAGAGTCGG 
TCTCTCAGCC 


GATAGTCCAA 
CTATCAGGTT 


AATAAAACAA 
TTATTTTGTT 


71 


AGGTAAGATT 

1 L^rll 1 Uliiii 


ACCTGGTCAA 

1 1 1 


AAGTGAAAAC 

I ± L^fH- 1 1 1 lb 


ATCAGTTAAA 

1 H\3 l 111 


AGGTGGTATA 

1 ^wfi.1 ri. 1 


AGTAAAATAT 

1 \ — ri I 1 1 1-tt.lri 


CGGTAATAAA 

bL-L-ill Iril i 1 


141 


AGGTGGCCCA 
TCCACCGGGT 


AAGTGAAATT 
TTCACTTTAA 


TACTCTTTTC 
ATGAGAAAAG 


TACTATTATA 
ATGATAATAT 


AAAATTGAGG 
TTTTAACTCC 


ATGTTTTGTC 
TACAAAACAG 


GGTACTTTGA 
CCATGAAACT 


211 


TACGTCATTT 
ATGCAGTAAA 


TTGTATGAAT 
AACATACTTA 


TGGTTTTTAA 
ACCAAAAATT 


GTTTATTCGC 
CAAATAAGCG 


GATTTGGAAA 
CTAAACCTTT 


TGCATATCTG 
ACGTATAGAC 


TATTTGAGTC 
ATAAACTCAG 


281 


GGTTTTTAAG 
CCAAAAATTC 


TTCGTTGCTT 
AAGCAACGAA 


TTGTAAATAC 
AACATTTATG 


AGAGGGATTT 
TCTCCCTAAA 


GTATAAGAAA 
CATATTCTTT 

EcoRI 


TATCTTTAAA 
ATAGAAATTT 


AAACCCATAT 
TTTGGGTATA 


351 


GCTAATTTGA 
CGATTAAACT 


CATAATTTTT 
GTATTAAAAA 


GAGAAAAATA 
CTCTTTTTAT 


TATATTCAGG 
ATATAAGTCC 


CGAATTCCAC 
GCTTAAGGTG 


AATGAACAAT 
TTACTTGTTA 


AATAAGATTA 
TTATTCTAAT 


421 


AAATAGCTTG 
TTTATCGAAC 


CCCCCGTTGC 
GGGGGCAACG 


AGCGATGGGT 
TCGCTACCCA 


ATTTTTTCTA 
TAAAAAAGAT 


GTAAAATAAA 
CATTTTATTT 


AGATAAACTT 
TCTATTTGAA 


AGACTCAAAA 
TCTGAGTTTT 


491 


CATTTACAAA 
GTAAATGTTT 


AACAACCCCT 
TTGTTGGGGA 


AAAGTCCTAA 
TTTCAGGATT 


AGCCCAAAGT 
TCGGGTTTCA 


GCTATGCACG 
CGATACGTGC 


ATCCATAGCA 
TAGGTATCGT 


AGCCCAGCCC 
TCGGGTCGGG 


561 


AACCCAACCC 
TTGGGTTGGG 


AACCCAACCC 
TTGGGTTGGG 


ACCCCAGTGC 
TGGGGTCACG 


AGCCAACTGG 
TCGGTTGACC 


CAAATAGTCT 
GTTTATCAGA 


CCACCCCCGG 
GGTGGGGGCC 


CACTATCACC 
GTGATAGTGG 


631 


GTGAGTTGTC 
CACTCAACAG 


CGCACCACCG 
GCGTGGTGGC 


CACGTCTCGC 
GTGCAGAGCG 


AGCCAAAAAA 
TCGGTTTTTT 


AAAAAAAGAA 
TTTTTTTCTT 


AGAAAAAAAA 
TCTTTTTTTT 


GAAAAAGAAA 
CTTTTTCTTT 


701 


AACAGCAGGT 
TTGTCGTCCA 


GGGTCCGGGT 
CCCAGGCCCA 


CGTGGGGGCC 
GCACCCCCGG 


GGAAAAGCGA 
CCTTTTCGCT 


GGAGGATCGC 
CCTCCTAGCG 


GAGCAGCGAC 
CTCGTCGCTG 


GAGGCCCGGC 
CTCCGGGCCG 



Figure SV B 



771 CCTCCCTCCG CTTCCAAAGA AACGCCCCGC ATCGCCACTA TATACATACC CCCCCCTCTC CTCCCATCCC 
GGAGGGAGGC GAAGGTTTCT TTGCGGGGGG TAGCGGTGAT ATATGTATGG GGGGGGAGAG GAGGGTAGGG 

841 CCCAACCCTA CCACCACCAC CACCACCACC TCCTCCCCCC TCGCTGCCGG ACGACGAGCT CCTCCCCCCT 
GGGTTGGGAT GGTGGTGGTG GTGGTGGTGG AGGAGGGGGG AGCGACGGCC TGCTGCTCGA GGAGGGGGGA 

911 CCCCCTCCGC CGCCGCCGGT AACCACCCCG CCCCTCTCCT CTTTCTTTCT CCGTTTTTTT TTTCGTCTCG 
GGGGGAGGCG GCGGCGGCCA TTGGTGGGGC GGGGAGAGGA GAAAGAAAGA GGCAAAAAAA AAAGCAGAGC 

981 GTCTCGATCT TTGGCCTTGG TAGTTTGGGT GGGCGAGAGC GGCTTCGTCG CCCAGATCGG TGCGCGGGAG 
CAGAGCTAGA AACCGGAACC ATCAAACCCA CCCGCTCTCG CCGAAGCAGC GGGTCTAGCC ACGCGCCCTC 

BamHI 



1051 GGGCGGGATC TCGCGGCTGG CGTCTCCGGG CGTGAGTCGG CCCGGATCCT CGCGGGGAAT GGGGCTCTCG 
CCCGCCCTAG AGCGCCGACC GCAGAGGCCC GCACTCAGCC GGGCCTAGGA GCGCCCCTTA CCCCGAGAGC 

Bglll 



1121 GATGTAGATC TTCTTTCTTT CTTCTTTTTG TGGTAGAATT TGAATCCCTC AGCATTGTTC ATCGGTAGTT 
CTACATCTAG AAGAAAGAAA GAAGAAAAAC ACCATCTTAA ACTTAGGGAG TCGTAACAAG TAGCCATCAA 

Hindi II Ncol 



1191 TTTCTTTTCA TGATTTGTGA CAAATGCAGC CTCGTGCGGA GCTTTTTTGT AGGTAGAAGC TTACCATGG 
AAAGAAAAGT AC T AAAC ACT GTTTACGTCG GAGCACGCCT CGAAAAAACA TCCATCTTCG AATGGTACC 



Kpnl-EcoRl - deletion underlined and restored NCO site in bold in vectors pJQ4.9 5 pJQ3.2 
andpJ06.3. 



Figure £j_ 



ALEURAINdeJeted NPIR (Apoplast) structure and sequence 



Nco I (9) 
Hw dill (2) 



Noil (8 7) 



ALEURAIN-NPIR-DEL 



+ 1 M AHA RVL LLAL A V L ATA A V A 

Hindlll Ncol 



1 AAGCTTACCA TGGCCCACGC CCGCGTCCTC CTCCTGGCGC TCGCCGTGCT GGCCACGGCC GCCGTCGCCG 
TTCGAATGGT ACCGGGTGCG GGCGCAGGAG GAGGACCGCG AGCGGCACGA CCGGTGCCGG CGGCAGCGGC 

+ 1VASS R A A 
Not I 



71 TCGCCTCCTC CCGCGCGGCC GCC 
AGCGGAGGAG GGCGCGCCGG CGG 



Figure C^x 



SEE1 ( Senescence enhanced ) PROMOTER sequence 

1 CATGGGCCAG GTATAATTAT GGGATATCTC AAGCAAATAA TCGAAATATC ACCATTGGCT ACAATATCTG 

PstI Xbal Xbal 



71 AGCTCCGAGT TGTGACTGCA GTCTGGATGA CGCGTGTTGT ATCTAGAACT CTAGATAGCA CAGCCACAGC 
141 ACCTACAGGA GTGCGACACT TGTGGACTGT AGTAGTGTTG GAGACGGAGC TCTTTCCTAC CTCCTGACGT 
211 TGCCGCCGTT GTCCATTCCA ACGGCATCAC TCTCAACCAA TCACGCGCTC CCAACAAAAT ATCGTCCCCC 

2 81 ATGTCTTGGC GGAGAGAGAG TACATACATG CTGTCGCGCC GTTTTTGTCT GAATCTCGCT TCCACTGGCC 

Smal 

3 51 AATCAGCTCA GCTCCCGGGA GCTCACTCAT TCAAGATCCC ATCGTCGTCG TCACCCCTGG CGTCATGGGA 
421 TGGAAAAGAA CCTCCGTTGC TCGGATGAGT CAGCCATATC CCCGAACAGA GTACTGCAAG ATAACCCAAT 

SphI 



4 91 TCAGATTCCC CCAATAGAGA AAGTATAGCA TGCTTTCGGG TTTTGTTTGG CTTAATTGAC TTTATTTTTG 
561 TTGGAGTTGA ATGCTGATTT GTTGTGTAAA ATGCCCAACC ATCTGAATAT CGAGACGGAT AATAGGCTGG 
631 CTAATTAATT TATAGCAAGA TTCTGTAGTG CACATCGCAA ATATCTTTCT GGGCATTACA GCTGGAGGCT 

PstI 



701 TCATCAGCCT GAAACACTCT GCAGAGCCTG AAGCAAGTGG TGAAGCGTGG CGATGAGATG GGTATAAAAC 
7 71 CCCCGGCACC GGGACGCGAG CTCCCGCCTA CCAGTACCAT CTCGCCTCGC TCCCCCTGCC GGACGACCCA 
841 GTAAAATACT GTTGCCCACT CGCCGGCGAG ATG 



Figure 



SEE1 ( Senescence enhanced ) PROMOTER plus vacuolar aleurain SIGNAL/NPIR sequence 

1 CATGGGCCAG GTATAATTAT GGGATATCTC AAGCAAATAA TCGAAATATC ACCATTGGCT ACAATATCTG 

PstI Xbal Xbal 



71 AGCTCCGAGT TCTGACTGCA GTCTGGATGA CGCGTGTTGT ATCTAGAACT CTAGATAGCA CAGCCACAGC 
141 AC CTACAGGA GTGCGACACT TGTGGACTGT AGTAGTGTTG GAGACGGAGC TCTTTCCTAC CTCCTGACGT 
211 TGCCGCCGTT GTCCATTCCA ACGGCATCAC TCTCAACCAA TCACGCGCTC CCAACAAAAT ATCGTCCCCC 

2 81 ATGTCTTGGC GGAGAGAGAG TACATACATG CTGTCGCGCC GTTTTTGTCT GAATCTCGCT TCCACTGGCC 

Smal 

3 51 AATGAGCTCA GCTCCCGGGA GCTCACTCAT TCAAGATCCC ATCGTCGTCG TCACCCCTGG CGTCATGGGA 
421 TGGAAAAGAA CCTCCGTTGC TCGGATGAGT CAGCCATATC CCCGAACAGA GTACTGCAAG ATAACCCAAT 

SphI 



4 91 TCAGATTCCC C C AAT AG AG A AAGTATAGCA TGCTTTCGGG TTTTGTTTGG CTTAATTGAC TTTATTTTTG 
561 TTGGAGTTGA ATGCTGATTT GTTGTGTAAA ATGCCCAACC ATCTGAATAT CGAGACGGAT AATAGGCTGG 
631 CTAATTAATT TATAGCAAGA TTCTGTAGTG CACATCGCAA ATATCTTTCT GGGCATTACA GCTGGAGGCT 

PstI 



701 TCATCAGCCT GAAACACTCT GCAGAGCCTG AAGCAAGTGG TGAAGCGTGG CGATGAGATG GGTATAAAAC 
7 71 CCCCGGCACC GGGACGCGAG CTCCCGCCTA CCAGTACCAT CTCGCCTCGC TCCCCCTGCC GGACGACCCA 

MAHG R I L FLA LAVL 
841 GTAAAATACT GTTGCCCACT CGCCGGCGAG ATGGCCCACG GCCGCATCCT CTTCTTGGCG CTCGCCGTCT 

BssHII 

Not I 

* ATA A V A AASL ADS NPI RPVT ERA* 
911 TGGCCACCGC CGCGGTGGCC GCCGCATCNT TGGCGGACTC CAACCCGATC CGGCCCGTCA CCGAGCGCGC 
Not I 



981 



- A A 

GGCCGCC 



